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Nonlinear  piecewise  polynomial 
approximation  beyond  Besov  spaces  * 

Borislav  Karaivanov,  Pencho  Petrushev 


Abstract 

We  study  nonlinear  n-term  approximation  in  Lp(R2)  (0  <  p  <  oo)  from  Courant 
elements  or  (discontinuous)  piecewise  polynomials  generated  by  multilevel  nested  tri¬ 
angulations  of  R2  which  allow  arbitrarily  sharp  angles.  To  characterize  the  rate  of 
approximation  we  introduce  and  develop  three  families  of  smoothness  spaces  gener¬ 
ated  by  multilevel  nested  triangulations.  We  call  them  B-spaces  because  they  can  be 
viewed  as  generalizations  of  Besov  spaces.  We  use  the  B-spaces  to  prove  Jackson  and 
Bernstein  estimates  for  n-term  piecewise  polynomial  approximation  and  consequently 
characterize  the  corresponding  approximation  spaces  by  interpolation.  We  also  develop 
methods  for  n-term  piecewise  polynomial  approximation  which  capture  the  rates  of  the 
best  approximation. 


1  Introduction 

Nonlinear  approximation  from  piecewise  polynomials  and  splines  is  a  central  theme  in  nonlin¬ 
ear  approximation  theory.  The  ultimate  problem  is  to  characterize  the  rate  of  approximation 
in  terms  of  certain  smoothness  conditions.  In  the  univariate  case  and  in  the  regular  case 
in  cl  dimensions  (cl  >  1),  this  problem  has  found  a  completely  satisfactory  solution  involv¬ 
ing  a  certain  class  of  Besov  spaces  and  the  machinery  of  .Jackson-Bernstein  estimates  and 
interpolation  (see  [PI],  [DJP],  [DPY],  and  also  [BI]  and  [D]). 

Our  goal  in  this  article  is  to  study  nonlinear  approximation  from  piecewise  polynomials 
over  triangulations  consisting  of  n  pieces.  The  difficulty  of  this  problem  stems  from  the 
highly  nonlinear  nature  of  piecewise  polynomials  in  dimensions  cl  >  1.  For  instance,  if  S\ 
and  S2  are  two  piecewise  polynomials  over  two  distinct  triangulations  of  [0,  l]2  consisting  of 
n  pieces  each,  then,  in  general,  Si  +  S2  is  a  piecewise  polynomial  over  more  than  n2  triangles 
(in  the  univariate  case,  the  number  of  pieces  is  at  most  2 n).  This  makes  the  idea  of  using 
a  single  smoothness  space  scale  (like  Besov  spaces)  and  the  recipe  of  proving  Jackson  and 
Bernstein  estimates,  and  interpolation  (like  in  the  univariate  case)  hopeless. 

In  this  article,  we  take  a  different  approach  to  this  problem.  First  of  all,  we  modify 
the  problem  by  considering  nonlinear  n-term  approximation  from  piecewise  polynomials 
generated  by  multilevel  nested  triangulations  of  M2.  We  consider  two  types  of  such  n- 
term  approximation:  (a)  from  Courant  elements  (continuous  piecewise  linear  elements)  and 
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(b)  from  (discontinuous)  piecewise  polynomials  over  triangles.  More  precisely,  we  consider 
nested  triangulations  {Tm}m&  such  that  each  level  Tm  is  a  partition  of  K2  and  a  refinement  of 
the  previous  level  Tm-i,  and  define  T  :=  U mez7m-  Each  nested  triangulation  T  generates  a 
ladder  of  spaces  •  •  •  C  <S_i  C  S0  C  <Si  C  •  •  •  (Multiresolution  Analysis)  consisting  of  piecewise 
polynomials  of  a  certain  degree  over  the  corresponding  levels.  In  the  case  of  continuous 
piecewise  linear  functions,  Sm  ( m  G  Z)  is  spanned  by  Courant  elements  ipg  supported  on 
cells  9  at  the  m-th  level  Tm.  We  impose  some  natural  mild  conditions  on  the  triangulations 
in  order  to  prevent  them  from  possible  deterioration.  At  the  same  time,  these  conditions 
allow  the  triangles  from  T  to  have  arbitrarily  sharp  angles  and  a  lot  of  flexibility.  After  this 
preliminary  structuring,  we  consider  nonlinear  approximation  from  n-terrn  piecewise  linear 
functions  of  the  form  S  =  ]C”=|  ao  >fo  or  piecewise  polynomials  of  degree  <  k  of  the  form 
S  =  E U  1a,  '  p A,,  where  9j  and  A j  may  come  from  different  levels  and  locations  (1a 
denotes  the  characteristic  function  of  A).  Note  that  in  both  cases  we  have  n-term  nonlinear 
approximation  from  redundant  systems.  So,  by  introducing  such  a  multilevel  structure,  we 
make  the  problem  somewhat  more  accessible  and  simultaneously  preserve  a  great  deal  of 
flexibility. 

Although  the  approximation  problem  has  been  tamed  to  some  extent,  it  still  remains 
highly  nonlinear.  It  is  crystal  clear  to  us  that  such  highly  nonlinear  approximation  cannot 
be  governed  by  a  single  (super)  space  scale  like  the  Besov  spaces  in  the  univariate  case.  For 
instance,  it  is  well  known  that  in  presence  of  functions  supported  on  very  “skinny”  triangles 
or  long  and  narrow  regions  the  Besov  spaces  are  completely  unsuitable  and  hence  useless  (see 
§2.5  below).  Thus  the  second  important  concept  is  to  quantify  the  approximation  process 
by  using  a  family  of  smoothness  spaces,  say,  Ba(T )  depending  on  the  triangulations.  We 
called  them  B-spaces.  So,  the  idea  is  to  measure  the  smoothness  of  the  functions  from  a 
family  (library)  of  space  scales  {Ba(T)}r  instead  of  a  single  smoothness  space  scale. 

The  third  important  issue  in  our  theory  is  the  way  we  represent  the  functions.  On  the 
one  hand,  all  Courant  elements  as  well  as  all  polynomials  restricted  to  triangles  generated 
by  a  nested  triangulation  form  redundant  systems.  On  the  other  hand,  there  are  no  good 
bases  available  which  consist  of  piecewise  polynomials  over  general  triangulations.  On  top 
of  this,  we  want  to  approximate  in  Lp(R 2),  0  <  p  <  oo.  There  is,  however,  a  good  and 
well-known  means  of  representing  functions  by  using  suitable  linear  or  nonlinear  projectors 
onto  the  spaces  {<Sm}  (see  §2.3  and  §2.4).  This  is  our  way  of  representing  the  functions. 

Our  approximation  scheme  is  the  following: 

(i)  For  a  given  function  /,  find  the  “right”  B-space  Ba(Tf)  (that  means  the  “right” 
triangulation  7})  in  which  /  exhibits  the  highest  smoothness  (equivalently,  in  which  /  has 
the  sparsest  representation). 

(ii)  Find  an  optimal  (or  near  optimal)  representation  of  /  by  Courant  elements  (or  piece- 
wise  polynomials)  generated  by  7/. 

(iii)  Using  this  representation  of  /,  run  an  algorithm  for  n-term  approximation  that  is 
capable  of  achieving  the  rate  of  the  best  n-term  approximation. 

The  first  step  in  this  scheme  is  the  hardest  one  and  we  still  do  not  have  a  satisfac¬ 
tory  algorithm  for  it.  There  is,  however,  an  effective  scalable  algorithm  for  this  step  in  the 
case  of  nonlinear  approximation  from  piecewise  polynomials  over  dyadic  partitions,  see  [P2], 
Once  the  triangulation  T  is  determined,  the  machinery  of  Jackson  and  Bernstein  estimates 
combined  with  interpolation  spaces  works  perfectly  well.  As  we  advance  through  the  imple- 
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mentation  of  the  above  program,  we  shall  see  that  all  technological  means  exist  or  can  be 
created  so  that  a  coherent  theory  can  be  developed.  The  lack  of  good  bases  for  our  spaces 
is  the  main  obstacle  that  makes  some  proofs  nonstandard.  In  particular,  the  Bernstein  in¬ 
equalities  are  the  most  troublesome  and  require  fine  analysis.  We  borrowed  a  few  ideas 
from  [P2],  where  similar  results  have  been  obtained  in  the  much  simpler  setting  of  nonlinear 
approximation  from  piecewise  polynomials  over  dyadic  boxes. 

The  B-spaces  from  this  article  can  be  considered  as  a  generalization  of  Besov  spaces  (see 
§2.5  below).  They  are  also  a  generalization  of  the  approximation  spaces  from  §3.4  in  [O]  (see 
the  references  therein). 

There  are  several  aspects  of  our  theory  that  we  do  not  even  touch  in  this  article,  including 
nonlinear  piecewise  polynomial  approximation  in  the  uniform  norm  (p  =  oo),  interpolation  of 
B-spaces  and  other  aspects  of  the  harmonic  analysis  of  B-spaces,  n-term  approximation  from 
smooth  piecewise  polynomials,  and  numerical  algorithms  for  nonlinear  piecewise  polynomial 
approximation  and  their  implementation  in  practice.  Some  of  them  will  be  tackled  in  a 
forthcoming  article. 

The  outline  of  the  paper  is  the  following.  Section  2  is  devoted  to  the  definition  and  de¬ 
velopment  of  B-spaces.  In  §2.1,  we  introduce  and  study  three  types  of  nested  triangulations 
of  R2,  which  later  serve  three  different  purposes.  In  §2.2,  we  give  all  necessary  facts  about 
local  polynomial  and  piecewise  linear  approximation.  In  §2.3,  we  introduce  and  develop 
the  first  family  of  B-spaces,  the  slim  B-spaces,  which  are  later  utilized  for  nonlinear  n-term 
Courant  element  approximation.  In  §2.4,  we  introduce  the  skinny  B-spaces  that  are  needed 
for  nonlinear  n-term  approximation  from  (discontinuous)  piecewise  polynomial.  In  §2.5,  we 
introduce  the  fat  B-spaces  which  are  the  most  immediate  generalization  of  Besov  spaces. 
Section  3  contains  our  main  results  about  nonlinear  piecewise  polynomial  approximation.  In 
§3.1,  we  give  some  general  guiding  principles  and  results  for  nonlinear  n-term  approximation. 
In  §3.2,  we  state  and  prove  our  main  results  concerning  n-term  Courant  element  approxima¬ 
tion  except  for  the  proof  of  the  Bernstein  inequality.  In  §3.3,  we  give  our  results  on  n-term 
piecewise  polynomial  approximation.  Subsection  3.4  is  devoted  to  discussion  of  some  aspects 
of  our  theory  and  open  problems.  Section  4  is  an  appendix.  In  §4.1,  we  prove  the  Bernstein 
estimates  we  need.  Subsection  4.2  contains  the  proofs  of  some  auxiliary  results. 

Throughout  the  article,  the  constants  are  denoted  by  c,  c.\. . . .  and  they  may  vary  at 
every  occurrence.  The  constants  usually  depend  on  some  parameters  that  will  be  sometimes 
indicated  explicitly.  The  notation  A  B  means  that  A  and  B  are  equivalent,  i.e.,  there  are 
two  constants  ci,C2  >  0  such  that  ci^4  <  B  <  C2A.  For  G  C  R2,  |G|  denotes  the  Lebesgue 
measure  of  G  and  tc  denotes  the  characteristic  (indicator)  function  of  G.  We  also  use  the 
following  notation:  ||  •  ||?  :=  ||  •  ||i<;g|2),  Ll°c  :=  Ll°c(R2)  (0  <  q  <  00),  and  L1^  :=  C(R2). 

2  B-spaces  over  triangulations 

In  this  section,  we  introduce  and  explore  three  collections  of  smoothness  spaces  (B-spaces), 
which  will  be  needed  in  §3-4  for  the  characterization  of  the  rates  of  nonlinear  piecewise 
polynomial  approximation.  The  B-spaces  can  be  defined  on  R2  or  on  any  polygonal  domain 
in  R2  as  well  as  in  Rd  (d  ^  2).  We  shall  restrict  our  attention  to  the  case  of  B-spaces  on  R2. 
The  B-spaces  are  defined  using  multilevel  nested  triangulations  which  we  discuss  below. 
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2.1  Multilevel  triangulations 

Here  we  introduce  several  types  of  multilevel  nested  triangulations. 

Weak  locally  regular  (WLR)  triangulations.  We  call  T  =  Umez  Tm  a  weak  locally 
regular  (WLR)  triangulation  of  R2  with  levels  { Tm  }  met  if  the  following  conditions  are  fulfilled: 

(a)  Every  level  Tm  defines  a  partition  of  R2,  that  is,  R2  =  Uag7;7!  ^  and  Tm  consists  of 
closed  triangles  with  disjoint  interiors. 

(b)  The  levels  {Tm}mez  of  T  are  nested,  i.e. ,  Tm+i  is  a  refinement  of  Tm. 

(c)  Each  triangle  A  G  Tm  ( rn  G  Z)  has  at  least  two  and  at  most  M0  children  (subtriangles) 
in  Tm+i,  where  M0  >  2  is  a  constant. 

(d)  For  any  compact  K  C  R2  and  any  fixed  m  G  Z,  there  is  a  finite  collection  of  triangles 
from  Tm  which  covers  K. 

(e)  There  exist  constants  0  <  r  <  p  <  1  (r  <  |)  such  that  for  each  A  G  Tm  ( rn  G  Z)  and 
any  child  A'  G  Tm+i  of  A 

r | A |  <  |A'|  <  p|A|.  (2.1) 

We  denote  by  Vm  and  Em  the  sets  of  all  vertices  and  edges  of  triangles  in  Tm ,  respectively. 
We  also  set  V  :=  V(T)  :=  \JmeZ  Vm  and  E  :=  E(T)  :=  \JmeM  E™- 

Locally  regular  (LR)  triangulations.  We  call  T  =  Umez  Tm  a  locally  regular  (LR) 
triangulation  of  R2  if  T  is  a  WLR-triangulation  of  R2  and  satisfies  the  following  additional 
conditions: 

(f)  No  hanging  vertices  ( NHV)  condition :  No  vertex  of  any  triangle  A  G  Tm  lies  in  the 
interior  of  an  edge  of  another  triangle  from  Tm. 

(g)  The  valence  Nv  of  each  vertex  v  of  any  triangle  A  G  Tm  (the  number  of  the  triangles 
from  Tm,  which  share  v  as  a  vertex)  is  at  most  Ay.  where  N0  is  a  constant. 

(h)  There  exists  a  constant  0  <  8  <  1  independent  of  m  such  that  for  any  A',  A"  G  Tm 
(rn  G  Z)  with  a  common  edge 

S  <  IA'1/IA"!  <  r1.  (2.2) 

For  v  G  Vm  (rn  G  Z),  we  denote  by  9V  :=  9V(T )  the  cell  associated  with  v,  i.e.,  9V  is  the 
union  of  all  triangles  A  G  Tm  which  have  v  as  a  common  vertex.  We  denote  by  0m  :=  Qm(T ) 
the  set  of  all  cells  generated  by  Tm  and  set  0  :=  0(T)  :=  Umez  @m- 

Strong  locally  regular  (SLR)  triangulations.  We  call  T  =  Umez  Tm  a  strong  locally 
regular  (SLR)  triangulation  of  R2  if  T  is  an  LR-triangulation  of  R2  and  satisfies  the  following 
additional  condition: 

(i)  Affine  transform  angle  condition  (AT A- condition):  There  exists  a  constant  [3  =  /3(T), 
0  <  / 3  <  7t/3,  such  that  if  A0  G  Tm.  rn  G  Z,  and  A  :  R2  — )■  R2  is  an  affine  transform  that 
maps  A0  one-to-one  onto  an  equilateral  reference  triangle,  then  for  every  A  G  Tm  which  has 
at  least  one  common  vertex  with  A0  and  for  every  child  A  G  Tm.+i  of  A0,  we  have 

minangle  (A(A))  >  (3,  (2.3) 

where  A(A)  is  the  image  of  A  by  the  affine  transform  A,  and  min  angle  (A7)  denotes  the 
magnitude  of  the  minimal  angle  of  A'. 

Obviously,  (i)  implies  (2.2)  with  some  S  =  d(/3). 
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Regular  (R)  triangulations.  By  definition,  T  =  {JmezTm.  is  a  regular  (R)  triangulation 
if  T  is  an  LR-triangulation  and  T  satisfies  the  following  condition: 

(j)  There  exists  a  constant  f3  =  fi(T)  >  0  such  that  the  minimal  angle  of  each  triangle 
A  G  T  is  >  / 3 . 

Evidently,  every  regular  triangulation  is  an  SLR-triangulation. 

Triangulations  on  compact  polygonal  domains  in  R2.  A  set  E  C  R2  is  said  to  be  a 
compact  polygonal  domain  if  E  can  be  represented  as  the  union  of  a  finite  set  7o  of  closed 
triangles  with  disjoint  interiors:  E  =  (Jag7B  A.  Weak  locally  regular,  locally  regular,  etc., 
triangulations  T  =  Um>o  7A  °f  such  domain  E  Cl2  are  defined  similarly  as  when  E  =  R2 . 
The  only  essential  distinctions  are  that  the  levels  {Tn}m>o  now  are  consecutive  refinements 
of  an  initial  (coarse)  level  %  and,  if  a  vertex  v  G  Vm  is  on  the  boundary,  we  should  include 
in  Vm  as  many  copies  of  v  as  its  multiplicity. 

Remarks.  It  is  a  key  observation  that  the  collection  of  all  SLR-triangulations  with  given 
(fixed)  parameters  is  invariant  under  affine  transforms.  The  same  is  true  for  similar  classes 
of  LR-triangulations  or  WLR-triangulations. 

Each  type  of  triangulation  depends  on  several  parameters  which  are  not  completely  in¬ 
dependent.  For  instance,  the  parameters  of  an  LR-triangulation  are  M0,  iV0,r,  p,  and  S.  We 
could  set,  e.g.,  M0  =  7  and  p  =  1  —  r,  and  eliminate  these  as  parameters,  but  this  would 
tend  to  obscure  the  actual  dependence  of  the  estimates  upon  given  triangulations. 

We  shall  need  to  know  what  happens  with  the  levels  Tm  of  a  triangulation  T  as  m  — >■  — oo. 
The  next  lemma  answers  this  question. 

Lemma  2.1.  For  each  WLR-triangulation  T  there  exists  a  finite  cover  T-oo  o/R2  consisting 
of  sets  with  disjoint  interiors  such  that  each  triangle  A  G  T  and  all  its  ancestors  are  contained 
in  a  set  A ^  G  T-oo ■  //A  ^  G  T-oo,  then  A ^  must  be  one  of  the  following:  the  all  o/R2,  a 
half-plane,  or  an  infinite  triangle  ( all  points  on  and  between  two  rays  that  are  not  collinear 
and  have  a  common  beginning ).  The  only  possible  configurations  for  T-oo  are  the  following: 

(a)  R2  only; 

(b)  finitely  many  infinite  triangles  with  a  common  vertex; 

(c)  two  half-planes; 

(d)  a  half-plane  and  finitely  many  infinite  triangles  which  cover  the  other  half-plane  and 
have  a  common  vertex  lying  on  the  boundary  betiueen  the  two  half-planes; 

(e)  two  finite  families  of  infinite  triangles,  each  family  covering  one  of  two  complimentary 
half-planes,  and  such  that  all  triangles  from  the  same  family  have  a  common  vertex  lying  on 
the  boundary  between  the  two  half-planes. 

Moreover,  if  T  satisfi.es  the  NHV- condition,  then  (a)  and  (b)  are  the  only  possible  con¬ 
figurations  for  T-oo- 

Proof.  Let  A  G  Tm  for  some  m  G  Z.  Then  there  exist  unique  triangles  {A,};.  A  j  G  7j, 
such  that  A  =:  Am  C  Am_i  C  •  •  •.  We  let  Aoo  :=  \J)<m  A,-.  Clearly,  if  A',  A"  G  T  then 
either  A'^  =  A'fi  or  A^  and  A'^  have  disjoint  interiors.  To  find  out  which  subsets  of  R2  can 
be  realized  as  A^,  we  order  the  vertices  of  the  triangles  { A  ;  }  in  a  sequence  {r^}.  If  { v &} 
does  not  have  limit  points  we  consider  two  cases.  First,  if  for  every  A  j  there  exists  i  <  j 
such  that  Aj  C  A°,  then  using  condition  (d)  from  the  definition  of  WLR-triangulations  one 
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can  easily  see  that  Aoo  is  all  of  R2 .  Alternatively,  if  there  exists  a  AJ0  which  is  not  contained 
in  A j  for  any  j  <  jo,  then  each  Aj,  j  <  j0,  has  an  edge  lying  on  a  given  line  l.  Since  {?y } 
does  not  have  limit  points,  those  edges  grow  infinitely  in  both  directions,  and  therefore  the 
whole  line  l  must  be  contained  in  A^.  Hence,  since  A^  is  always  convex,  it  must  be  either 
a  half-plane  or  a  strip.  Using  that  {v^}  does  not  have  limit  points  and  condition  (d),  one 
can  prove  that  sup^^  dist(:r,  l )  =  oo,  which  shows  that  A^  cannot  be  a  strip. 

If  the  sequence  {w*.}  has  a  limit  point,  sav  Xo,  then  using  condition  (d)  we  obtain  that 
there  exists  jo  <  m  such  that  xo  is  a  vertex  of  all  A  j  with  j  <  j0.  From  condition  (d),  it 
follows  that  a  vertex  can  have  only  finite  valence  at  any  given  level.  This  fact  readily  implies 
that  {vk}  cannot  have  more  than  one  limit  point  and  also  that  if  {«&}  has  exactly  one  limit 
point  then  A^  is  an  infinite  triangle. 

Simple  arguments  utilizing  condition  (d)  limit  the  possible  configurations  for  T-oo  to 
those  described  in  the  lemma.  There  are  straightforward  examples  showing  that  each  of 
those  configurations  can  be  realized.  □ 

Examples  of  triangulations  and  refinement  schemes.  We  begin  with  the  description 
of  a  standard  refinement  scheme  that  can  be  used  to  refine  a  given  triangle  A  infinitely  many 
times.  In  the  first  step,  we  select  a  point  on  each  edge  of  A  and  then  join  each  pair  of  new 
points  by  a  line  segment.  This  first  step  gives  us  four  disjoint  triangles,  say,  Ai,  A2,  A3,  A4 
which  become  the  first  generation  of  triangles  (the  children  of  A).  In  the  second  step,  we 
subdivide  each  A  j  in  the  way  described  in  step  one  and  obtain  the  second  generation  of 
triangles.  Proceeding  inductively,  we  subdivide  each  triangle  from  a  given  generation  in 
the  fashion  of  step  one,  thus  producing  the  next  generation  of  triangles.  Let  Tm{A)  denote 
the  set  of  all  triangles  from  the  m-tli  generation.  Then  T(A)  :=  Um=07m(A)  a  nested 
triangulation  of  A. 

Now,  we  describe  a  standard  procedure  for  constructing  triangulations  of  R2.  We  first 
cover  R2  by  a  sequence  of  growing  triangles  A0  C  A+  C  A2  C  . . .,  where  every  A  j  is  a 
child  of  Aj+1,  and  then  refine  all  children  of  all  { A  y }  using  the  standard  refinement  scheme 
described  above.  More  precisely,  let  A0  be  any  initial  triangle.  We  select  a  triangle  Ai  so 
that  A0  is  a  child  of  Ai.  We  similarly  define  A2  D  Ai  so  that  Ai  is  a  child  of  A2,  etc.  In 
this  way  we  obtain  a  growing  sequence  of  triangles.  The  only  additional  condition  that  we 
impose  on  {Aj},  so  far,  is  that  R2  =  |J  '  ,,  A,.  After  having  constructed  the  sequence  {Aj}, 
we  subdivide  the  children  of  each  A  j  (j  =  1,2,...)  as  it  was  described  above.  We  denote  by 
{' Tm}m&  the  sets  °f  triangles  from  each  level  and  by  T  :=  (JmeZ  Tm  the  whole  triangulation 
of  R2.  Variety  of  other  refinement  schemes  can  be  utilized. 

How  fast  can  the  elements  of  triangles  change?  We  investigate  how  the  elements  (|A'|, 
min  angle  (A7),  and  max^(A'),  the  longest  edge  of  A')  of  a  triangle  A'  G  Tm,  (m  G  Z)  can 
change  as  A7  moves  away  from  a  fixed  triangle  A"  G  Tm ,  for  different  types  of  triangulations 

T. 

First,  we  consider  the  case  of  an  arbitrary  weak  locally  regular  triangulation  T- 
Clearly,  even  if  T  satisfies  the  NHV-condition  of  the  LR-triangulations,  it  may  happen  that 
A7,  A77  G  Tm  (m  G  Z)  are  two  adjacent  triangles  and  at  the  same  time  each  of  the  ratios 
|A"J,  max e(A"]  ’  an<^  mhaii^e(A")  arbitrarily  large  (or  small)  independently  of  the  other  two. 
This  is  possible  because  the  first  common  ancestor  of  A7  and  A77  may  be  at  an  extremely 
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distant  level,  or  even  A7  and  A"  may  not  have  a  common  ancestor  at  all  (see  Lemma  2.1). 
This  fact  makes  the  WLR-triangulations  unsuitable  for  continuous  piecewise  polynomial 
approximation. 


Secondly,  we  consider  the  case  of  an  arbitrary  locally  regular  triangulation  T-  Bv 
definition  (see  (2.2)),  if  A7,  A"  £  Tm  and  A7  and  A77  have  a  common  edge,  then  |  A7 1  m  |  A77 1 . 
However,  it  may  happen  that  the  ratios  and  are  uncontrollably  large 

(or  small),  see  Figure  1.  To  show  that  this  situation  is  possible  we  shall  need  the  following 
simple  lemma. 

Lemma  2.2.  Let  IJJU-oo  ^m,  n  £  7h,  satisfy  the  conditions  of  the  WLR-triangulations  or  LR- 
triangulations  or  SLR-triangulations  with  some  fixed  parameters.  Assume  also  that  level  Tn 
is  refined  uniformly  by  introducing  the  midpoints  on  the  edges  of  each  A  £  Tn  and  connecting 
them  by  line  segments  ( see  the  standard  refinement  scheme  described  above).  Denote  by  Tn+i 
the  set  of  all  triangles  obtained  from  the  refinement  of  Tn-  Then  Tm  satisfies  the 

conditions  of  the  corresponding  type  of  triangulation  with  exactly  the  same  parameters. 

Proof.  This  lemma  is  fairly  obvious  and  its  proof  will  be  omitted.  □ 

Armed  with  this  lemma,  one  can  easily  construct  the  claimed  example.  We  shall  give  only 
a  sketch  of  it.  We  start  from  a  uniform  triangulation  T  of  M2  generated  by  an  equilateral 
triangle  A0  (see  the  examples  of  triangulations  above).  Let  Tm,  denote  the  levels  of  T  for 
m  <  0.  The  incomplete  triangulation  Um=_oo  Tm,  obviously  satisfies  the  conditions  of  the 
LR-triangulations  with  any  parameters  t)<r<p<l,r<\,  p>\.  We  fix  such  r  and  p. 
We  now  refine  %■  We  choose  any  two  triangles  A7,  A77  £  7o  with  a  common  edge,  say  e.  We 
may  assume  that  e  is  horizontal.  It  is  not  very  hard  to  see  (but  it  is  not  obvious)  that  7o  can 
be  refined  twice  so  that  U^-oo  Tm  satisfies  the  conditions  of  the  LR-triangulations  with  the 
already  selected  parameters  r  and  p,  and  that  there  are  two  grandchildren,  say,  A72  and  A"  of 
A7  and  A77,  respectively,  with  the  following  properties:  (a)  A72  and  A"  have  a  common  edge, 
say,  62  C  e  of  length  t(e2)  =  ^T(e);  (b)  |A2|  =  |A2|  =  ^|A7|  (=  A"  );  (c)  A2  is  equilateral 
and  A"  is  skewed  to  the  right  (or  left)  at  £  •  i{e2)  with  e  =  s(r,  p)  >  0.  More  precisely,  the 
vertex  of  A",  which  does  not  belong  to  e2,  is  shifted  to  the  right  from  the  midpoint  of  e2  at 
distance  e  ■  b(e2).  We  shall  call  the  above  an  angle  sharpening  procedure.  We  next  refine  T2 
sufficiently  many  times,  by  using  only  midpoints,  until  we  reach  a  level,  say,  TSl  at  which 
there  exist  two  great-grandchildren,  say,  A7Si  and  A'fi  of  A2  and  A",  respectively,  such  that 
A7Si  and  A"  have  a  common  edge,  |  A7Si  |  =  A"  |,  A7Si  is  equilateral,  A"  is  similar  to  A",  and 
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most  importantly  the  minimal  number  of  edges  from  VSl  (edges  of  triangles  in  TSl)  which 
connect  an  arbitrary  vertex  of  A'  or  A"  with  any  vertex  of  A'2  or  A"  is  sufficiently  large 
(so,  A'  and  A"  are  located  in  the  middle  of  A2  U  A").  By  Lemma  2.2,  1J^__00  Tm  satisfies 
the  conditions  of  the  LR-triangulations  with  the  already  fixed  parameters  p  and  r.  Since,  in 
TSl,  A'  and  A"  are  surrounded  by  triangles  that  are  equivalent  to  A'  or  A"  ,  we  can  again 
apply  our  angle  sharpening  procedure,  followed  by  sufficiently  many  midpoint  refinements, 
and  keep  going  on  in  the  same  fashion.  We  use  induction  to  complete  the  construction  of 
the  claimed  example. 

Let  us  consider  now  an  arbitrary  strong  locally  regular  triangulation  T.  From  the 
definition  of  SLR-triangulations,  it  follows  that  if  A',  A"  G  Tm ,  m  G  Z,  and  A'  and  A" 
have  a  common  vertex,  then  |A'|  A"|.  min  angle  (A')  min  angle  (A"),  and  max  ('(A1) 

max  l(A").  However,  this  does  not  mean  that  T  is  regular  or  close  to  regular.  It  may  happen 
that  some  triangles  of  T  have  arbitrarily  small  angles,  while  others  are  equilateral. 
Example  of  an  SLR-triangulation  T  with  the  property 

inf  min  angle  (A)  =  0. 

AeT 

We  shall  utilize  the  idea  of  the  construction  from  the  previous  example.  As  above,  we  assume 
that  T  =  Um=— oo  is  an  incomplete  uniform  triangulation  generated  by  an  equilateral 
triangle  A0.  Clearly,  T  satisfies  the  conditions  of  the  SLR-triangulations  for  M0  =  4  and 
an  arbitrary  0  <  0  <  7t/3.  We  fix  0  and  M0.  Choose  A  G  %.  It  is  readily  seen  that  % 
can  be  refined  so  that  (J)L))=_00  Tm  satisfies  the  conditions  of  the  SLR-triangulations  with  the 
fixed  parameters  0  and  M0,  and  there  exists  at  least  one  child,  say,  Ai  G  71  of  A  such  that 
min  angle  (Ai)  <  q  ■  min  angle  (A)  with  q  =  q(ji)  <  1.  The  next  step  is  to  refine  T\  several 
times  by  using  only  midpoints  until  we  obtain  a  great-grandchild,  say,  ASl  G  TSl  of  Ai  which 
is  sufficiently  far  from  the  boundary  of  Ax  (in  terms  of  number  of  edges  from  VSl  needed 
to  connect  it  with  the  boundary).  By  Lemma  2.2,  |J^=00  Tm  satisfies  the  conditions  of  the 
SLR-triangulation  with  the  fixed  parameters  / 3  and  M0.  After  that,  we  apply  the  above  angle 
sharpening  procedure  to  ASl  and  then  we  again  refine  by  midpoints  for  sufficiently  many 
levels,  etc.  Inductively,  we  obtain  the  needed  triangulation. 

We  now  introduce  one  more  natural  condition  on  triangulations: 

Minimal  angle  condition  (M A- condition):  There  exists  a  constant  0  =  d(T),  0  <  0  <  1, 
such  that  if  A0  G  Tm ,  m  G  Z,  then  for  every  A  G  Tm  which  has  at  least  one  common  vertex 
with  A0  and  for  every  A  G  Tm+i  which  is  a  child  of  A0, 

,  min  angle  (A)  .  . 

0  <  — - ,  \  <  W1.  (2.4) 

mm  angle  (A0) 

Lemma  2.3.  If  T  is  an  SLR-triangulation,  then  T  satisfies  the  MA-condition  above  with 
0  =  0(0).  However,  the  MA-condition  is  weaker  than  the  ATA- condition. 

Proof.  Suppose  T  is  an  SLR-triangulation  and  let  A0  G  Tm ,  m  G  Z.  We  may  assume  that 
the  largest  edge  of  A0  is  of  length  one.  We  introduce  a  coordinate  system  Ox ix2  so  that 
the  origin  O  is  at  the  vertex  of  the  sharpest  angle  of  A0  and  the  largest  edge  of  A0  lies  on 
the  positive  half  of  the  aq-axis.  Without  loss  of  generality,  we  can  assume  that  A0  is  in 
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the  upper  right  quadrant  of  Ox  \x2-  We  select  the  equilateral  reference  triangle  A'0  to  be  in 
the  upper  right  quadrant  of  Ox  1X2  and  have  one  edge  coinciding  with  the  longest  edge  of 
A0.  Evidently,  both  the  affine  (linear  in  this  case)  transform  A  which  maps  A0  one-to-one 

onto  A'0  and  its  inverse  A-1  have  matrices  of  the  form 

of  A'0  with  vertex  at  the  origin  and  magnitude  of  7t/3  is  transformed  by  A-1  into  an  angle 
of  magnitude  7,  0  <  7  <  7r/3.  In  this  setting,  routine  (but  not  trivial)  calculations  show 
that  A-1  transforms  any  angle  of  magnitude  >  [3  into  an  angle  of  magnitude  >  07.  where 
c  =  c(/3)  is  a  positive  constant.  We  skip  all  details  and  only  note  that  it  suffices  to  prove 
the  above  fact  only  for  angles  with  vertex  at  the  origin  because  the  affine  transforms  map 
parallel  lines  into  parallel  lines.  This  result  implies  that  T  satisfies  the  MA-condition. 

The  MA-condition  does  not  imply  the  ATA-condition  because  the  following  configuration 
of  triangles  is  possible:  Let  Ai  :=  [(0,  0) , ( 1 ,  0),(e/2,  eV 3/2)],  where  e  >  0  is  sufficiently  small. 
Denote  by  A2  the  triangle  symmetric  to  A\  with  respect  to  the  aq-axis.  Further,  let  A3  and 
A4  be  the  images  of  A4  and  A2  after  rotation  of  — 2tt/3  about  the  origin,  and  let  A5,  and 
Ag  be  the  images  of  A|  and  A2  after  rotation  of  2w/3  about  the  origin.  A  triangulation 
containing  this  kind  of  configuration  on  one  level  can  be  constructed  for  an  arbitrary  small  s 
by  starting  from  some  level  of  a  uniform  triangulation  consisting  of  equilateral  triangles  and 
“sharpening”  the  angles  near  a  given  node  in  three  equiangular  directions  while  refining  the 
rest  of  the  triangulations  uniformly,  as  in  the  previous  example.  Obviously,  this  configuration 
does  not  violate  the  MA-condition  but  due  to  the  presence  of  sharp  angles  in  different 
directions  the  ATA-condition  fails.  □ 

Our  next  theorem  provides  estimates  for  the  rate  of  change  of  the  elements  of  triangles 
from  a  given  level  of  a  triangulation  when  moving  away  from  a  fixed  triangle.  For  these 
estimates,  we  need  the  following  simple  lemma. 

Lemma  2.4.  Suppose  T  is  an  LR-triangulation.  If  A',  A"  G  Tm,  m  G  7L,  and  A7  and  A" 
can  be  connected  by  <  T  intermediate  triangles  ( with  common  vertices )  from  Tm,  then  there 
exist  Ai,  A2  G  Tm-2N0u  with  a  common  vertex  such  that  A7  C  A4  and  A"  C  A2j  where  N0  is 
from  condition  (g)  of  LR-triangulations. 

Proof.  From  conditions  (c)  and  (g)  on  LR-triangualtions  (§2.1),  it  follows  that  every  edge 
of  a  triangle  from  Tm  is  subdivided  at  least  once  after  2 N0  steps  of  refinement.  From  this,  we 
infer  that  if  G  C  K2,  then  Qm(Qm(G))  C  nm~2No{G),  where  Ql{G)  :=  U {6  G  0*  :  9°nG  ±  0} 
(9°  denotes  the  interior  of  9).  Applying  this  fact  v  times,  we  obtain  that  A"  C  D'A  2  v°l/(  {/•}). 
where  v  is  an  appropriate  vertex  of  A7.  Then  the  existence  of  A ,  and  A2  follows  readily. 
□ 

Theorem  2.5.  (a)  Let  T  be  an  LR-triangulation  with  parameters  0  <  r  <  p  <  1  and  N0.  If 
A7,  A77  G  Tm,  m  G  7L,  and  A'  and  A"  can  be  connected  by  n  (n  >  1)  intermediate  triangles 
from  Tm,,  then 

c^n-3  <  |^jj-  <  cins  (2.5) 

with  s  :=  2iV0log2((  and  c4  :=  S~N°(^)2N 


1  Mi 

0  u2 


.  Suppose  that  the  angle 
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(b)  Let  T  be  an  SLR-triangulation  with  parameter  0  <  [3  <  7r/3.  If  A7,  A77  G  Tm,  m  G  Z, 
and  A'  and  A"  can  6e  connected  by  n  (n  >  1)  intermediate  triangles  from  Tm,  then 


min  angle  ( A' ) 

- ,  /  A  ;  <  c2n ‘ 

min  angle  (A") 


(2.6) 


with  t.  :=  4A0log2|  and  c2  :=  $  4A°  1,  where  N0  :  =  [^f\  and  D  =  d(/4)  zs  the  constant  from 
the  MA-condition  whose  existence  is  established  by  Lemma  2.3. 


Proof,  (a)  Let  u  G  Z  be  such  that  2"~ 1  <n<  T .  By  Lemma  2.4,  there  exist  A!,A2  G 
Tm~2N0v  with  a  common  vertex  such  that  A7  C  A,  and  A"  C  A2.  By  (2.2),  5N°  <  |  Ax  | / 1 A2 1  < 
d_A'°,  and  by  (2.1),  it  follows  that  |A7|  <  p2N°u | Ax |  and  |A"|  >  r2N°v |A2|.  Combining  the 
above  estimates,  we  obtain  (2.5). 

(b)  The  proof  of  (2.6)  is  quite  similar  to  the  proof  of  (2.5)  and  uses  Lemma  2.3.  We  omit 
it.  □ 


2.2  Local  polynomial  and  piecewise  linear  approximation 

We  let  Il/t  denote  the  set  of  all  algebraic  polynomials  in  two  variables  of  total  degree  <  k. 
For  a  function  /  G  Lq(G ),  Gcl2,0<g<oo,  and  k  >  1,  we  denote  by  Ek(f,  G)q  the  error 
of  Lq ((^-approximation  to  /  from  If/.,  i.e., 

E„U,G),:=  inf  ||/-P||W0).  (2,7) 

Also,  we  denote  by  LOk{f,G)q  the  A>th  modulus  of  smoothness  of  /  on  G: 

uk(f,  G)q  ■■=  sup  ||A*(/,-)||Mo)»  (2-8) 

hen2 

where  A \{f,x)  =  A \{f,x,G)  :=  ^^=0(  —  l)k+:>  (kj)f{x  +  jh)  if  the  line  segment  [x,x  +  kh]  is 
enirely  contained  in  G  and  A*(f,x)  :=  0  otherwise. 

For  an  LR-triangulation  T  and  A  G  Tm  ( rn  G  Z),  we  denote  by  Qa  the  union  of  all 
triangles  A7  G  Tm  which  have  a  common  vertex  with  A,  i.e., 

fiA  :=  U{A7  G  Tm  :  A7  n  A  ^  0}.  (2.9) 

Also,  we  define 

nl  :=  U{A7  GTro:A'n  QA  ±  0}.  (2.10) 

Lemma  2.6.  [Whitney]  Suppose  G  :=  A  or  G  :=  for  some  triangle  A  G  Tm  ( rn  G  Z); 
where  T  is  an  SLR-triangulation  o/M2.  If  f  G  Lq(G),  0  <  q  <  oo,  and  k  >  1,  then 

Ek{f,G)q  <  cuk(f,G)q  (2.11) 

with  c  =  c(q ,  k)  or  c  =  c(q,  k ,  j3),  where  (3  is  the  parameter  of  T  from  (2.3). 

For  the  proof  of  this  lemma,  see  the  appendix  (§4.2). 

We  shall  often  use  the  following  lemma,  which  establishes  relations  between  different 
norms  of  polynomials  over  different  sets. 
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Lemma  2.7.  Let  P  E  LR;  k  >  l,  and  0  <  p,q  <  oo. 

(a)  Let  A'  C  A  be  two  triangles  such  that  |A|  <  ci|A'|.  Then 

ll^lkp(A)  <  c||P||tp(A.) 


(2.12) 


with  c  =  c(p ,  A’,  ci). 

(b)  Suppose  A'  C  A  are  two  triangles  such  that  |A'|  <  p|A|  with  0  <  p  <  1  or  A'  =  0. 
Then 

II-PIIma)  -  cII-P|U„(a\a')  ~  |A|1//p_1//<?||P||L(!(A\A')  (2.13) 

with  constants  depending  only  on  p,  q,  k,  and  p. 

(c)  IfT  is  an  LR-triangulation  and  A  E  T ,  then 

l|P|k<nA)  *  (2.14) 

with  constants  of  equivalence  depending  only  on  p,  q,  k,  N0,  and  S. 

(d)  If  P  EU2  and  A  =  [xi,x2,xf\  C  M2  is  a  triangle,  then 

llplk(A)  ~  |A|1/9  max  |P(a:i)|,  (2.15) 

with  constants  of  equivalence  depending  only  on  q. 

Proof.  Estimates  (2.12)-(2.15)  are  invariant  under  affine  transforms  and  hence  they  follow 
from  the  case  when  A  is  an  equilateral  triangle  with  |A|  =  1  by  change  of  variables.  The 
details  will  be  omitted.  □ 

We  find  useful  the  concept  of  near  best  approximation  which  we  borrowed  from  [DP], 
A  polynomial  Pa  €  fifi  is  said  to  be  a  near  best  Lg(A)-approximation  to  /  from  Ifo  with 
constant  A  if 

\\f  ~  PaWl^a)  <  AEk(f,A)q.  (2.16) 

Note  that  if  q  >  1,  then  a  near  best  L?(A)-approximation  PA  =  Pa(/)  can  be  easily  realized 

by  a  linear  projector. 

Lemma  2.8.  Suppose  0  <  q  <  p  and  PA  is  a  near  best  Lq(A)- approximation  to  f  E  Lp( A) 
from  Hk.  Then  PA  is  a  near  best  Lp( A) -approximation  to  f . 

Proof.  See  Lemma  3.2  from  [DP]  and  also  the  proof  of  Lemma  2.12  in  the  appendix  (§4.2). 

□ 


We  next  introduce  some  necessary  notation.  Let  T  =  \Jmez  '3e  a  WLR-triangulation. 
For  m  E  Z  and  k  >  1,  we  let  S ^  :=  Sk(Tm )  denote  the  set  of  all  piecewise  polynomial 
functions  over  Tm  of  degree  <  k ,  i.e.,  S  E  S ^  if  S  =  J^AeTm  ere  1a  is  the 

characteristic  function  of  A  and  PA  E  LR. 

Now,  let  T  =  LLez  Tm  be  an  LR-triangulation.  For  v  E  Vm  (m  E  Z),  we  let  6V  denote 
the  cell  in  Tm  associated  with  v  (§2.1).  The  NHV-condition  on  LR-triangulations  (§2.1) 
guarantees  the  existence  of  a  Courant  element  pgv  supported  on  9V  which  is  a  continuous 
piecewise  linear  function  that  takes  the  value  one  at  v. 
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For  rn  G  Z,  we  denote  by  Sm  :=  S(Tm )  the  set  of  all  continuous  piecewise  linear  functions 
over  Tm,  i,e.,  Sm  =  <S2ra  n  C(R2).  From  the  NHV-condition  on  T,  each  S  G  Sm  has  the 
representation:  S  =  ^2veVm  S(v)T0v  and  hence  Sm  =  spanj^g  :  6  G  0m}. 

Throughout  the  rest  of  this  section,  we  assume  that  T  is  an  LR-triangulation  of  R2  with 
parameters  M0,  N0,  r,  p,  and  5  (see  §2.1). 

Lemma  2.9.  Suppose  {cig}geemf  111  G  Z ,  is  a  sequence  of  real  numbers  and  S  :=  agpg. 

Let  also  0  <  q  <  oo.  Then,  for  every  A  G  Tm,  we  have 

l|S|k(A)  ~  (  £  ll«»V,»ll|)1/*  (2.i7) 

0e&m:  AC0 

and,  hence, 

ll^lli^K2)  ~  (  (2.18) 

oe@m 

with  constants  of  equivalence  depending  only  on  q,  N0,  and  5.  In  the  case  q  =  oo,  the  Iq-norm 
above  is  replaced  by  the  sup -norm. 

Proof.  Clearly,  S(vg)  =  ag  (vg  is  the  “central  point”  of  9)  and  \\pg\\q  ~  \9\^q.  Therefore, 
using  Lemma  2.7,  (d)  and  the  regularity  of  T,  we  have,  for  A  G  Tmf 

IIS'IIl  (a)  ~  I A 1 1//,?  max  \ag\  ~  max  \ag\ \9\l^q 
13  (9e0m:AC  0  0eem:AC0 

~  (  \\a9To\\ i) 1  ■  □ 

0eQm:  Ace 


Quasi-interpolant.  We  shall  utilize  the  following  well-known  quasi-interpolant  for  con¬ 
structing  projectors  into  spaces  of  continuous  piecewise  linear  functions: 

Qm(f)  =  Qmi.fi  T)  :=  5]  (/,  q>o)<po,  (2.19) 

0e@m 

where  (f,g)  :=  Ir2  fg  and  {(pg)  are  duals  of  {tpg}  defined  by 

T 0  :=  ’  ^A’0’ 

AeTm.AcO 

with  Aa ,0  the  linear  polynomial  which  assumes  values  N  °|A  at  vg  (the  “central  point”  of  6) 
and  —  N  3|A|  at  the  other  two  vertices  of  A  (here  NV0  is  the  valence  of  vg).  Evidently, 

(pOi  To')  =  <W,  9, 9'  £  ©m- 

It  is  easily  seen  that  the  quasi-interpolant  Qm  satisfies  the  following: 

(a)  Qm.  '■  Llfc  — >■  Sm  is  a  linear  operator. 

(b)  Qm  is  a  projector  into  Srn .  i.e.,  Qm(S)  =  S  for  S  G  Sm. 

Other  properties  wrill  be  given  in  the  following. 
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Lemma  2.10.  If  f  G  Ljj'c ,  1  <  rj  <  oo,  and  A  G  Tm,  m  £  Z;  £/ien 

||Qm(/)||^(A)  <  c\\f\\Lrj(nA) 

with  c  =  c(rj,  No,  5). 

Proof.  It  is  readily  seen  that 

\{f,vo) I  <  \\f\\tn(e)\\f>eh  <  c||/||l,(0)||^||oo|^|w  <  c\e\-^\\f\\Lm 
and  \\^Pe\\r]  A  c\9\lIv,  where  1  /rf  :=  1  —  1/rj.  Therefore,  for  every  A  G  Tm , 


||<9m(/)IU,(A)  <  ^  K/’^)||ke||J?  <  c  ^  ||/||l„(0)  <  c 

oeom,ocnA  ee  om,ocnA 

Lemma  2.11.  If  S  G  Sfn,  0  <  rj  <  oo,  and  A  G  Tm,  m  G  Z;  £/ien 

||Qm(*S')|U,(A)  <  c||5|U,(nA) 


Lr)(£l  A  ) 


.  □ 


with  c  =  c(r ),  N0,  S). 

Proof.  If  7]  >  1,  then  the  estimate  follows  by  Lemma  2.10.  Let  0  <  rj  <  1.  We  use  the 
estimate  ||^||^  <  c| 0 1 1/,7? ,  properties  of  LR-triangulations  (§2.1),  and  Lemma  2.7,  (b),  to 
obtain 


IIOn^lkfA)  < 
< 

< 


c  i<5,^)iii^ii^  <  c  n^iiooii5iulW|6ii; 

oe&m,ocn  a  eeom,ecnA 


*  E  i»r1+'iisiu.w<«  E  Gr1+'[|siiL,(A-) 

0eom,ocnA  A'f.'/,.,.  a^A'a 

«  E  IISIIMAO  <  «l|s||Mni).  □ 


A'eTm,  A'cOa 


Local  piecewise  linear  approximation.  For  a  given  /  G  Z/°c,  r]  >  0,  and  A  G  7m,  m  G  Z 
(recall  that  T  is  an  LR-triangulation),  we  define  the  error  of  Z^-approximation  to  /  on  Qa 
from  Sm  by 

Sa(/),:=  SA(/,r)„:=  .inf  ||/-5||MflA).  (2.20) 

5e5m 

Similarly  as  in  the  polynomial  case,  we  say  that  5  G  Sm  is  a  near  best  Z^-approximation 
to  /  on  Qa  from  Sm  with  a  constant  A  if 

Hf  -  SljLv(nA)  <  ASA(f)v. 

Lemma  2.12.  Suppose  0  <  //,  <  r)  and  S  is  a  near  best  L^- approximation  to  f  G  Ln(VL A)  on 
Qa  from  Sm.  Then  S  is  a  near  best  Lv- approximation  to  f  on  12a  from  <Sm. 
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The  proof  of  this  lemma  is  similar  to  the  proof  of  Lemma  3.2  of  [DP]  (see  also  Lemma  2.8 
above).  For  completeness,  we  give  it  in  the  appendix  (§4.2). 

The  quasi-interpolant  (defined  above)  is  a  simple  and  useful  tool  for  constructing  projec¬ 
tors  into  Sm  with  good  localization  properties.  For  rj  >  0  and  f  E  iff .  let  Pa,,  =  PaM) 
be  a  near  best  LJ?(A)-approximation  to  /  from  n2.  Note  that  if  r/  >  1,  then  PA,,(-)  can  be 
realized  as  a  linear  projector  into  the  space  of  linear  polynomials  restricted  on  A.  However, 
Pa,,{-)  is  nonlinear  if  rj  <  1.  Let 

Sm,n(f )  :=  1a  •  PA,v(f )  for  m  E  Z. 

AeTm 

Clearly,  Sm^(f)  E  Sf  and  Sm^(S)  =  S  for  every  S  E  Sfn.  We  set 


TmM)=TmMT)  :=  Qm(SmM ))■ 


(2.21) 


This  construction  is  well-known  and  is  needed  when  working  in  Ln  with  0  <  r)  <  1.  Evidently, 
Pm,, if)  e  Sm  and  Tm,,(S )  =  S  for  S  E  Sm. 

The  next  lemma  establishes  the  good  local  approximation  properties  of  the  operators  Qm 
and  Tm. 

Lemma  2.13.  (a)  If  f  E  Ll°c ,  1  <  t)  <  oo,  and  A  E  Tm,,  rn  >  0,  then 


\\f-Qm(f)\\Lv(A)<cSA(f),.  (2.22) 

(b)  If  f  E  Ll°c,  0  <  rj  <  oo,  and  A  E  Tm,  m  >  0,  then 

\\f-Tm,,(f)\\Lv(A)<C§A(f),.  (2.23) 

The  constants  above  depend  only  on  r]  and  the  parameters  of  T ■ 


Proof.  To  show  that  (2.23)  holds,  we  choose  SA  E  Sm  for  which  Sa if),  is  attained,  i.e., 
11/  -  5A||i^nA)  =  Sa(/V  Then 


\\f-Tm(f)\\Lv(A)  = 


< 

< 

< 

< 


Wf  -  Qm(Sm(f))\\Lnm 
\\f-SA  +  SA-Qm(Sm(m\LviA) 

c\\f  -  s.\\  r.u{A)  +  c\\Qm{SA  -  Sm{f))\\Lv{A} 

cSA(f)v  +  c\\SA-Sm(f)\\Lv(nA) 

mA(f),  +  c||/  -  SA 1 1 L ,( ( f!  a  )  +  C\\f  -  Sm(f)\\Lv(nA) 

c&A{f)„ 


where  we  used  that  Qm{SA )  =  SA  on  A,  Lemma  2.11,  and  the  obvious  inequality  \\f  — 
Sm{f)\\Lv(nA)  <  [| f  ~  S\\  Thus  (2-23)  is  proved. 

The  proof  of  (2.22)  is  similar  and  will  be  omitted.  □ 
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(2.24) 


Lemma  2.14.  (a)  If  f  G  Ll°c,  1  <  r/  <  oo,  then  for  A  G  T 

11/  -  <3m(/)||%(A)  ->■  0  as  m  ->■  oo. 

(b)  If  f  G  Lj;,c ,  0  <  r)  <  oo;  £/ien  /or  A  G  T 

11/  -  rm(/)||^(A)  ^  0  as  m  ->■  oo.  (2.25) 

Proof.  Using  (2.1)  and  simple  geometric  arguments,  one  can  show  that  if  e  is  an  edge  of  a 
descendant  of  A,  and  e  does  not  emanate  from  a  vertex  of  A,  then  |e|  <  (1  —  r)diam(A).  By 
condition  (g)  on  LR-triangulations  (§2.1),  at  any  given  level  there  can  be  at  most  3 N0  edges 
starting  from  the  vertices  of  A.  From  conditions  (c)  and  (g),  it  follows  that  every  edge  e  is 
subdivided  within  less  than  2iV0  levels  after  its  first  appearance,  and  by  (2.1)  each  of  the 
pieces  of  e  has  length  <  (1  —  r)diam(A).  Combining  the  above  observations,  we  conclude 
that  after  less  than  61Vq  levels  of  refinement  all  edges  of  descendants  of  A  will  have  lengths 
<  (1  —  r)diam(A).  From  this  we  derive  that 

max{diam(A')  :  A'  G  %n  .  A'  C  11a}  — ?•  0  as  m  — >■  oo. 

Hence,  \\f  -  Sm{f) ||l„(oa)  ->■  0  and  \\f  -  Sm{f) ||l„(qa)  ^  0  as  m  ->■  oo,  where  Sm(f)  is  a 
(the)  best  L^-approximation  to  /  on  Qa  from  Sm.  Therefore, 

11/  -  Tm(f) IU.CA)  <  c|| /  -  Sm(f) |U,)(A)  +  c\\Qm(Sm(f)  -  Sm(f)) ||MA) 

<  C\\f-  Sm(f) |U„(A)  +  c\\Sm(f)  -  Sm(f) ||^nA) 

<  c\\f  -  Sm{f)\\Lv(nA)  +  c\\f  -  Sm(f)kn(nA)  0 

as  m  — >•  oo,  where  we  used  that  Qm{S a)  =  5a  on  A  and  Lemma  2.11.  Thus  (2.25)  is  proved. 
The  proof  of  (2.24)  is  similar.  □ 

2.3  Slim  B-spaces 

In  this  section,  we  introduce  a  collection  of  smoothness  spaces  (B-spaces)  which  we  later 
used  for  characterization  of  nonlinear  n-term  Courant  element  approximation.  Throughout 
the  section,  we  assume  that  T  is  an  arbitrary  locally  regular  triangulation  of  M2  (see  §2.1). 
The  B-spaces  will  depend  on  T-  This  dependence  may  or  may  not  be  indicated  explicitly. 

Definition  of  slim  B-spaces  via  local  approximation.  We  define  the  slim  B-space 
B^T),  a  >  0,  0  <  p,  q  <  oo,  as  the  set  of  all  /  G  LP(R2)  such  that 

H/llB&mHI/llp+(£[2”“(  E  Si(/)j)1/'],)1/,<oo,  (2.26) 

me z  AeT,  2~m  <|  A|<2_m+1 

where  Sa {f)q  ■=  §a(/,  T)q,  for  A  G  Tm ,  denotes  the  error  of  Z^-approximation  to  /  on  0A 
from  Sm  (see  (2.20)),  and  the  ^-norrn  is  replaced  by  the  sup-norm  if  q  =  oo. 

We  shall  further  study  only  a  specific  class  of  slim  B-spaces  which  are  exactly  the  smooth¬ 
ness  spaces  needed  for  nonlinear  Courant  Lp-approximation  (see  §3.2).  We  assume  that 
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0  <  p  <  oo  and  a  >  0,  and  define  r  by  the  identity  1/r  :=  a  +  1/p.  We  shall  need  the  slim 
B-space  £?“  :=  R“(T),  which  is  a  slightly  modified  version  of  the  space  R“t(T)  from  above. 
We  define  B°(T)  as  the  set  of  all  functions  /  G  LP(R 2)  (in  place  of  /  G  Z/r(R2))  such  that 

11/11*?  =  II/IIbsit)  :=  (  £(|A|- "Sa(/)r)")1/T  <  oo.  (2.27) 

a  er 

Remark.  In  the  above  definition,  the  condition  /  G  LP(R2)  is  not  restrictive  since  R“(T) 
is  embedded  in  (see  Theorems  2.15-2.16  below).  Its  only  role  is  to  eliminate  a  possible 
component  S ^  of  /,  which  is  a  piecewise  polynomial  on  infinite  triangles  G  71oo  (see 
Lemma  2.1).  This  condition  can  be  replaced,  e.g.,  by  the  condition:  |{a>  :  \f(x)\  >  s}|  <  oo 
for  each  s  >  0  (see  Theorem  2.15  below).  It  also  can  be  replaced  by  the  condition  /  G  LT(R 2) 
as  in  the  definition  of  R“r(T)  (see  (2.26)),  which  is  a  little  bit  restrictive  since  the  spaces 
LP(R 2)  and  LT(M2)  (r  ^  p)  are  not  embedded  into  one  another.  However,  this  condition 
is  not  too  restrictive  since  our  approximation  tool  in  §3.2  consists  of  compactly  supported 
piecewise  polynomials  and  hence  all  theorems  from  §3.2  would  hold  if  it  is  used. 

Evidently, 

\\f  +  g\\rB?  <  \\f\\TB?  +  Mb?,  r*  :=  min{r,  1}. 

Also,  if  ||/||bq  =  0,  then  Sa(/)t  =  0  for  each  A  G  T-  From  this,  it  readily  follows  that  / 
coincides  with  a  linear  polynomial  on  each  Aoo  G  71  oo-  Therefore,  using  that  /  G  Lp,  we 
infer  that  /  =  0  a.e.  Thus,  for  a  fixed  LR-triangulation  T,  ||  •  || b?(t)  a  norm  if  r  >  1  and 
a  quasi-norm  if  r  <  1.  In  the  following  “norm”  will  stand  for  “norm”  or  “quasi-norm”. 

We  next  introduce  other  equivalent  norms  in  B®(T)  which  will  enable  us  to  operate  more 
freely  with  B-spaces.  For  /  G  l7°c(M2),  7)  >  0,  we  define 

X3,„(/):=iVs.,,(/,r)  :=  (  ^(|A|-"+1/t-1/’>Sa(/),)t)1/T 

a  er 

=  (Y/ml--ll”SMU)T)1,T ,  (2.28) 

a  er 

where  we  used  that  1/r  =  a  +  1/p.  Clearly,  iVgiT(/)  =  ||/||Ba. 

Atomic  decomposition  of  £?“( 7~).  For  /  G  Lp(R2),  we  define 

N*(f)  =  NMT)  :=  inf  (E(l^niwl|r)T)1/T,  (2.29) 

where  the  infimum  is  taken  over  all  representations  /  =  co^o  with  convergence  in 

Lp(  A)  for  each  A  G  T.  (The  existence  of  such  representations  of  /  follows  by  Lemma  2.14.) 
As  will  be  seen  in  the  proof  of  Theorem  2.15  below 

Ew  ■“||ce^||T)T  <  oo  implies  ||  ^  |ce^(-)|||  <  oo 
dee  oee  p 
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and  hence  Xloe©  \cbVb{')\  converges  a.e.  and  unconditionally  in  LP(R2).  Therefore,  the  order 
of  the  terms  in  the  series  above  is  not  essential.  By  Lemma  2.7,  it  follows  that 

NM)*f  inf  (Ellc^ll;)1/T-  (2-30) 

Definition  of  norms  in  B^(‘T)  via  projectors.  We  define,  for  m  E  Z, 

Qm  ■ —  Qm  Qm—i  and  tm ji  . —  Tm ^  Tm— 1,77 -  (2.31) 

For  a  given  function  /  6  Lj)'c(R2),  1  <  r)  <  00,  clearly  qm(f )  £  Sm  and  we  define  uniquely 
the  sequence  {bg{f)}geem  (m  £  from  the  expression 

QmU)  =■■  Y  WW  (2.32) 

Oe&ra 


Also,  if  /  e  L^M2), 
by 


0  <  t)  <  00,  then  E  Sm.  In  this  case,  we  define  {bo,v{f)}oeem 

)  =■  Y  (2.33) 


ee&r, 


Evidently,  {7>6» ( ' ) }  and  {b0,vi')}  with  7j  >  1  are  linear  functionals,  while  {V ??(•)}  are  nonlinear 
if  0  <  rj  <  1. 

We  define 

NqAD  =  Nq,t(1,T)  :  =  (^(l#H|Vr(/)^IWT)1/T,  (2.34) 

Bee 

where  bg,T(f)  :=  bg(f)  are  from  (2.32)  (or  from  (2.33))  if  r  >  1  and  bg,T(f)  are  from  (2.33)  if 
r  <  1.  More  generally,  we  define 


AW/)  =  AW  AT)  :=  (E(I»I1/^1/1V,(/)^II,)t)1/T.  (2.35) 

eee 

By  Lemma  2.9,  we  have 

AW/)“  (£(lAl1/’"1/lWnllMA>)T)1/T.  if  r)  >  1,  (2.36) 

a  eT 

XqM)  “  (  E(lA|1/,'_1/’'llf^(/)ll«Al)T)1/T.  if  0  <  >)  <  1,  (2-37) 

a  er 

and,  in  both  cases, 

aw/)  ~  ( E 11  w/wi;) I/T.  (2.38) 

Bee 

Our  next  step  is  to  show  that  the  slim  B-space  B“(T)  is  embedded  in  Lp( M2).  To  do 
this,  we  invoke  Theorem  3.3,  proved  later  in  §3.1,  which  is  however  completely  independent 
of  this  section,  and  can  therefore  safely  be  used. 
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Theorem  2.15.  If  \{x  :  \f(x)\  >  s}|  <  oo  for  each  s  >  0  and  NQ^(f,T)  <  oo  for  some 
0  <  rj  <  oo,  then  f  G  LP(R2), 


f  =  E  bg^(f)ipg  absolutely  a.e.  on 


ee& 


and  unconditionally  in  Lp(R2),  and 


p  — 


^\be,v{f)Ve{-)\  <cNQttl(f) 


oee 


with  c  depending  only  on  a,  p,  g,  and  the  parameters  of  T ■ 


(2.39) 


(2.40) 


Remark.  Observe  that  the  condition:  \{x  :  \f{x)\  >  s}|  <  oo  for  each  s  >  0  is  satisfied  if 
/  G  Lq(R2)  for  an  arbitrary  q  <  oo. 

Proof.  Let  us  consider  the  case  when  NQtV(f)  is  defined  via  the  coefficients  bg^(f)  from 
(2.33).  We  introduce  the  following  abbreviated  notation:  Tm  :=  Tm^(f),  trn  :=  t.m^(f), 
bo  ■■=  bg^(f),  and  N{f)  :=  (£)fle0  \\bo^o\\Tp)1/r  ■  Note  that  NQ<n{f)  «  N{f),  by  (2.38).  Since 
T  is  an  LR-triangulation,  the  sequence  {$,„}  :=  {bgpg}geQ  satisfies  requirements  (i)-(ii)  of 
the  general  embedding  Theorem  3.3  below.  Therefore,  J2oee  I bgipe(-)\  <  oo  a.e.  on  M2  and 


J2\beM-)\ 

oee 


<  cN{f). 


(2.41) 


Hence 


and 


|tj(-)|  <  oo  a.e.  on  M2 


j'GZ 


<  cN(f )  <  oo. 

p 


(2.42) 

(2.43) 


Evidently,  (2.39)  and  (2.41)  imply  (2.40).  Therefore,  it  suffices  to  prove  that  (2.39)  holds. 
To  this  end,  we  first  show  that 


f  =  T0  +  ^^t.j  absolutely  a.e.  on  l2.  (2.44) 

j= i 


Set  g  :=  T0  +  i  0  pointwise.  By  (2.42),  it  follows  that  g  is  well  defined.  Clearly, 
9  =  Tm  +  Y^jLm+itj  a-e-  for  m  e  Hence,  by  (2.43), 


11#  -  Tm\\p  < 


E  lo(-) 

j  =m+ 1 


->•  0 
p 


as 


m  — >■  oo. 


(2.45) 


On  the  other  hand,  /  G  i/°c(M2)  and  by  Lemma  2.14  we  have,  for  A  G  T, 

11/  -  Tm\\Lv(A]  -)0  as  m  -»■  oo. 
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From  this  and  (2.45),  it  follows  that  g  =  f  a.e.  and  hence  (2.44)  holds. 

We  shall  next  prove  that  for  every  Aoo  G  7LX  (see  Lemma  2.1)  there  exists  a  unique 
linear  polynomial  Pa <*,  such  that 

o 

T0  —  PAoo  =  tj  absolutely  a.e.  on  A^.  (2.46) 

j=- oo 

Using  Lemma  2.7,  we  have  that  for  any  A  G  Tj  (j  G  Z) 

IlfjllwA)  <  c|A|-p||^||MA)  <  c|A|-p  \\bWo\\p  -  c\A\~pN{f).  (2.47) 

0e&j'.Ac6 


Since  T  is  an  LR-regular  triangulation,  if  A  C  A',  A  G  7*,  and  A'  G  7),  then  |  A  |  <  pk~j  |A'|, 
where  0  <  p  <  1  is  the  parameter  of  T  from  (2.1).  Using  this  and  (2.47),  we  obtain,  for 
A  G  7,  1  G  Z, 


k  k 

Y  l|ijllt„(A)  <  cAT(/)|A|-V»  £  pC*-i)/p  <  c|A|-'/-iV(/)  <  do.  (2.48) 

j— — oo  j=—oc 

For  A  G  7,  we  set  Pa  :=  7).  x  0  pointwise.  By  (2.42),  the  series  converges  absolutely 

a.e.  and,  therefore,  Pa  is  well-defined.  Clearly,  Pa  =  Tm  —  ^  /  ;  for  m  <  k  and,  hence, 

by  (2.48), 


\\Tm  Pa||loo(a)  A  lL 


< 


j=-oo 


^oo(A) 


E 


j=-oo 


h  I  Too  (a) 


— >■  0  as  m  — >■  — oo. 


(2.49) 


Since  all  A/s,  j  <  Aq  are  linear  polynomials  on  A  G  7,  so  is  Pa-  Moreover,  PA  is  the 
same  polynomial  for  all  A  G  T  contained  in  a  fixed  A^  G  T-oo .  Indeed,  let  A',  A"  G  T, 
ALA"  C  Aoo  (A/  and  A"  are  possibly  from  different  levels).  Since  A^  is  an  infinite  union 
of  nested  triangles,  there  exists  A  G  T  such  that  ALA"  C  Ac  A^.  By  (2.49), 


II Tm  ~  Pa'IIloo(a')  -)■  o  and  ||Tm  -  Pa||l00(a')  0  as  m  ->■  -oo. 

Hence  PA'  =  PA.  Similarly,  PA"  =  PA.  Therefore,  there  exists  a  unique  linear  polynomial 
PAoo  such  that  (2.46)  holds. 

Combining  (2.44)  with  (2.46),  we  obtain 


Pa^  =  absolutely  a.e.  on  A^,  A^  G  Too- 


(2.50) 


Using  that  tj  G  Tp(M2)  and  the  hypothesis  of  the  theorem,  we  obtain 

\{x  €  Aoo  :  | TRaoo ( ^’ ) I  >  s}|  <  |{z  :  \f{x)\  >  s/2} |  +  \{x  :  |  ^A/:r)|  >  s/2} | 


jez 


< 


|{;C:|/W|>S/2}|  +  (V2r<'  E‘4 

jf£  Z 


<  oo, 


19 


for  each  s  >  0.  Since  A,*,  is  an  infinite  triangle  or  a  half  plane  or  M2  and  Pa^  is  a  polynomial, 
this  is  only  possible  whenever  Pa^  =  0.  Thus  (2.39)  is  established. 

The  proof  of  the  theorem  when  AqiJ?(/)  is  defined  via  the  coefficients  bo^if)  :=  bg(f) 
from  (2.32)  is  the  same  and  will  be  omitted.  □ 

Theorem  2.16.  For  f  G  Bf(T),  the  norms  \\f\\B?(T),  Ns,v{f )  (0  <  77  <  p),  N^{f),  and 
NQ^f)  (0  <  r)  <  p),  defined  in  (2.27)-(2.29)  and  (2.35)  are  equivalent  with  constants  of 
equivalence  depending  only  on  p,  a,  r),  and  the  parameters  of  T ■ 

Proof.  By  (2.30),  (2.38),  and  Theorem  2.15,  it  follows  that 

NM)  <  cNqM)-  (2-5i) 

Clearly,  if  A  G  Tm  and  A'  is  the  (unique)  parent  of  A  in  Tm_  1,  we  have 

\\tm,r)(f)  lU-RA)  A  c  ||/  —  Tmtn(f)  llxRA)  +  c  \  \f  ~  Tm-ljgif)  ||l^(A,| 

<  cSa(/)jj  +  cSa'(/)j?, 

where  we  used  (2.23).  A  similar  estimate  holds  for  \\qm{f)\\Lv(A),  using  (2.22).  These  imply 

NqM)  <  cNsM-  (2.52) 

We  next  prove  that  if  N$(f)  <  00,  then 

AM/)  -  cAW)  for  0  <  h<P-  (2-53) 

By  Holder’s  inequality,  it  follows  that 

AM/)  <  NsAfh  <>  <  /'  <  r. 

Thus  it  suffices  to  prove  (2.53)  only  for  r  <  p  <  p. 

Suppose  /  G  Lp  and  N$(f)  <  00.  Let  /  =  Y2ffe&C0iP0  be  an  arbitrary  representation  of 
/,  where  the  convergence  is  in  Lp( A)  for  every  A.  Recall  that 

Xs.„(/):=(^(|A|J-Jsi(/)„r)',  (2.54) 

ACT 


where  Sa (f)n  is  defined  in  (2.20).  Evidently,  Sa (g)n  =  0  for  A  G  Tm  if  g  G  Sm ,  and 
§a (9)n  <  I  !l\  Now,  fix  A  e  Tn,  n  G  Z.  Using  the  above  properties  of  Sa (g)n  and 

Theorem  3.3  with  {$TO}  :=  {cgpg  :  9  G  0,0  C  (for  the  definition  of  see  (2.10)),  we 

obtain 


Sa(/)J  = 


< 


< 


OO 

sa(  J2ce^°),  - 

j=n-\- 1  O^Qj 


j=n+ 1  //•  (•) 


L„{n  A) 


I]  S 


<  C  ^  llwll^ 

0eo,ocn2A 


C  \°\T( ''  ^ll^flllr* 

ee&,ecn2A 
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where  for  the  latter  inequality  we  used  that  \\po\\q  ~  0  <  q  <  oo.  Substituting  the 

above  estimate  in  (2.54),  we  get 

^M/r  <  c^|ait(^_^  y 

a  er  oee,ocn2A 

<  CY  Y1  {\9\/\^\y{^\\9\^\\ce^e\\rY  (2.55) 

a  er  ee&,ocn2A 

<  cYm^WceMrY  Y  d0i/iAi)T("~^’ 

oee  AeT-.ecn2A 

where  we  once  switched  the  order  of  summation.  By  condition  (g)  on  LR-triangulations 
(§2.1),  we  have,  for  9  G  Qj, 

#{A  G  Tj  ;  9  c  Oi)  <  c(iV0)„ 

and  by  (2.1  )-(2.2),  |#|  <  c(Ar0,  <5)/>7|A|,  if  0  C  Q2A  with  A  G  Tm-j  and  9  G  0TO.  Hence,  for 
9ee\ 

OO 

'Y/  {\9\/\A\)TY~Y  <  YTY~Y  <  c  <  oo,  (2.56) 

AeT-.0cn2A  j= o 

where  we  used  that  p  <  1  and  p,  <  p.  Finally,  combining  (2.56)  with  (2.55),  we  obtain 

n?,mv  <  e£(ir"iiw.n,r, 

eee 

which  implies  (2.53).  Evidently,  (2.51)-(2.53)  imply  the  theorem.  □ 

Remark.  The  following  simple  example  shows  that,  in  general,  Theorem  2.16  is  not  valid 
for  7]  >  p.  Let  f  :=  (fg  for  some  9  G  0.  It  is  not  hard  to  see  that  ||/[|_Ba(r)  m1,r  1 1  Lp Q  1 5 
while  N§tV(f,  T)  =  oo,  if?/  >  p.  Therefore,  iV§^(/,  T)  is  not  equivalent  to  ||/[|b“(t1  if  V  A  [>■ 

2.4  Skinny  B-spaces 

In  this  section,  we  define  a  second  family  of  B-spaces  which  we  shall  use  in  §3.3  for  the 
characterization  of  nonlinear  (discontinuous)  piecewise  polynomial  approximation  generated 
by  nested  triangulations. 

Throughout  this  section,  we  assume  that  T  is  an  arbitrary  weak  locally  regular  triangu¬ 
lation  of  R2  (see  §2.1).  We  define  the  skinny  B-space  £>))*' ( T )'-  ol  >  0,  0  <  p,  q  <  oo,  k  >  1, 
as  the  set  of  all  /  G  LP(R2)  such  that 

ll/llssfm-  ll/ll W(£[2”“(  E  ^(/.AI^yAoo,  (2.57) 

m£Z  AeT,  2_m  <|  A|<2_m+1 

where  uJk(f,A)p  is  the  local  modulus  of  smoothness  of  /,  defined  in  (2.8). 

As  for  the  slim  B-spaces,  we  shall  explore  in  more  details  only  the  skinny  B-spaces  that 
are  needed  in  nonlinear  piecewise  polynomial  Lp-approxi mati on .  Suppose  0  <  p  <  oo,  a  >  0, 
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k  >  1,  and  let  1/r  :=  a  +  1/p.  We  shall  need  the  skinny  B-space  which  is  a  slight 

modification  of  B®k(T)  from  above,  and  is  defined  as  the  set  of  all  /  G  LP(R 2)  (in  place  of 
/  G  Lr(M2))  such  that 

ll/ll**  =  ll/ils^CD  :=  (^(|A|-”Wt(/,A)rr)i/T  <oo.  (2.58) 

a  eT 

Whitney’s  estimate  (Lemma  2.6)  implies 

ll/ils-m  ~  (  £(|A|-“Et(/,  A)Tr)1/T,  (2,59) 

a  eT 

where  Ek(f,  A)q  is  the  error  of  /^-approximation  to  /  on  A  from  Lb  (see  (2.7)). 

If  ||/||Ba*  =  0,  then  Ek(f,A)r  =  0  for  each  A  G  T-  From  this,  it  readily  follows  that 
/  =  lAoo  •  -Paoo  (PAoo  G  11*.)  on  each  A^  G  71  oo-  Therefore,  using  that  /  G  Lp,  we  infer  that 
/  =  0  a.e.  Thus,  ||  •  || s«k(T)  is  a  norm  if  r  >  1  and  a  quasi-norm  if  r  <  1. 

Remark.  The  only  difference  between  skinny  B-spaces  and  slim  B-spaces  is  that  the  local 
approximation  from  continuous  piecewise  linear  functions  on  sets  0A.  A  G  T,  is  replaced  by 
local  polynomial  approximation  on  triangles  from  71  The  key  is  that  the  triangles  from  T 
form  a  tree  with  respect  to  the  inclusion  relation,  while  the  sets  1/a,  A  G  T  do  not  form 
a  tree;  they  overlap  more  significantly.  This  fact  allows  for  developing  the  theory  of  the 
skinny  B-spaces  and  their  application  to  nonlinear  (discontinuous)  piecewise  approximation 
(see  §3.3)  under  less  restrictive  conditions  on  the  triangulations,  namely,  for  weak  locally 
regular  triangulations. 

Next,  we  introduce  two  other  equivalent  “norms”  in  B^k(T).  For  /  G  Z/°C(R2),  ?/  >  0, 
we  define 

A r„,„(/,  T)  :=  (^(lAr+i-W/.A),)1)'  (2.60) 

a  eT 

=  ( £<|A|$-W/,  A),r)'  *  (  £(|a|G^(/,  A)„r) 

a  eT  a eT 

where  we  used  that  1/r  =  a  +  1/p.  Clearly,  Afw,T{f,T)  =  ||/||b«*(T)- 

For  each  A  G  T  and  tj  >  0,  we  let  Pa,^/)  be  a  near  best  //^(^-approximation  to  / 
from  Lb  with  a  constant  A  which  is  the  same  for  all  A  G  T  (see  (2.16)).  Note  that  if  7)>  1, 
then  Pa,??(/)  can  be  realized  as  a  linear  projector  into  the  space  of  polynomials  of  degree 
<  k  restricted  on  A.  Let  Pmfl(/ )  :=  Y^AeTm  1a  •  Pa,jj(/)-  Clearly,  PTOfl(/)  is  a  near  best 
//^-approximation  to  /  from  S/n ( T )  and  a  projector  into  S/n ( P ) .  We  define 

PmM)  ■=  PmM ,  T)  :=  PmM)  ~  Pm-ljf)  *  ^(T),  (2.61) 

and  set  pA^if)  ■=  1a  •  Pm,r,(f)  for  A  G  Tm,  We  define 

Vp,,(/,T)  :=  (  ^(|A|1/"-W||pa,„(/)||„)b/T.  (2.62) 

a  er 
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Using  Lemma  2.7,  we  obtain 


M>,,(/,T)»  (^(|A|-»||pA,,(/)||Tr)1/T*  (^||Pi,„(/)||;)1/T.  (2.63) 

a eT  AeT 

The  following  embedding  theorem  is  pivotal  for  our  theory  of  nonlinear  piecewise  poly¬ 
nomial  approximation. 

Theorem  2.17.  If  \{x  :  \f{x)\  >  s}|  <  oo  for  each  s  >  0  and  AfpyTI{f,  T)  <  oo  (0  <  p  <  oo), 
then  f  €  Lp(R2), 

f  =  Pm,r){f)  absolutely  a. e.  otil2  (2.64) 

mGZ 

and  unconditionally  in  Lp,  and 

ll/llp  <  |  E  IjW/>|||  <  cMP„U,T)  (2.66) 

mGZ  P 

with  c,  depending  only  on  a,  k,  p,  p,  and  the  parameters  of  T ■ 

Proof.  Since  T  is  a  WLR-triangulation,  the  sequence  {$„>}  :=  {pa,jj(/)}a£T  satisfies 
requirements  (i)-(ii)  of  Theorem  3.3  below.  Therefore, 

I  Ipa,„(/)|  <  c(  WpaMM)  '  ~  cMpMT)  <  OO.  (2.66) 

AeT  p  AeT 

From  this,  similarly  as  in  the  proof  of  Theorem  2.15,  it  follows  that  for  every  Aoo  6  T-oo 
(see  Lemma  2.1)  there  exists  a  polynomial  Pa^  €  LR  such  that 

/  -  Pa.  =  ^Pnwif)  absolutely  a.e.  on  A*. 

me  z 

Using  that  \{x  :  \f{x)\  >  s}|  <  oo  for  s  >  0  and  (2.66),  we  infer  Pa ^  =  0  and  the  theorem 
follows.  □ 

We  next  give  the  equivalence  of  the  skinny  B-norms  introduced  above. 

Theorem  2.18.  For  each  f  G  Bfk(T),  the  norms  ||/||b«*(tp  AfWtr){f,  T)  (0  <  p  <  p),  and 
Npji ( f  i  T)  (0  <  p  <  p)  are  equivalent  with  constants  of  equivalence  depending  only  on  a,  k, 
p,  p,  and  the  parameters  of  T ■ 

Proof.  The  proof  of  this  theorem  is  similar  to  (but  easier  than)  the  one  of  Theorem  2.16 
and  will  be  omitted.  The  difference  is  that  the  role  of  Sa {f)p  is  now  played  by  cj^(/,  A)^. 
See  also  the  proof  of  Theorem  2.20  below.  □ 

Remark.  The  following  simple  example  shows  that,  in  general,  A/"w^(/,  T)  is  not  equivalent 
t°  ||/||s“fc(T)  ^  V  ^  Let  f  —  1a  for  some  /X  £  /~T’.  It  is  ertsily  seen  t  licit  ~ 

IaIVp  =  ll/IU  while  A fu,n(f,T)  =  oo  if  p  >  p. 
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2.5  Fat  B-spaces:  The  link  to  Besov  spaces 

Throughout  this  section,  we  assume  that  T  is  an  arbitrary  strong  locally  regular  triangulation 
of  M2  (§2.1).  We  define  the  fat  B-space  Mpk(T),  a  >  0,  0  <  p,  q  <  oo,  k  >  1,  as  the  set  of  all 
/  G  Lp(R 2)  such  that 


!(T) 


+ 


(E 


VPl  «\  1  /q 


r  /  v  \  VP’  ?\ 

2m“(  E  ]  ) 

ACT,  2_m  <|  A|<2_m+1 


<  oo, 


where  Qa  is  defined  in  (2.9). 

As  in  the  previous  sections,  we  shall  focus  our  attention  only  on  the  scale  of  fat  B-spaces 
which  naturally  occur  in  nonlinear  approximation,  namely,  the  spaces  B“fc(T),  where  a  >  0, 
&>l,0<p<oo,  and  1/r  :=  a  +  l/p.  We  define  the  space  Mfk  (T)  as  the  set  of  all  functions 
/  G  Lp(R 2)  such  that 

II/IIe^CT)  :=  (  ^(|ArQ^(/,fiA)r)T)  7  <  oo,  (2.67) 

A  eT 

which  is  a  modification  of  the  space  Mfk  (T)  from  above.  By  Whitney’s  inequality  (Lemma  2.6), 
we  have 

ll/lk*m“  (E(lAr“V(/,SA)T)T)1/T, 

A  eT 

where  £&(/,  H a)t  is  the  error  of  LT-approximation  to  /  on  Ha  from  II  (see  (2.7)). 

Note  that  the  use  of  HA  in  the  definition  of  ||/||ip*(t)  is  not  crucial.  It  is  almost  obvious 
that,  for  instance, 

ii/Mr)«(E<i9rv*(/,fV)T)1/T. 

eee 

It  is  critical,  however,  that  the  neighboring  sets  in  the  collections  {Ha}agt  or  {Q}ee@  overlap 
significantly.  This  makes  the  difference  between  the  fat  and  skinny  B-norms. 

Clearly,  for  /  G  Lr(M2)  and  A  G  T,  we  have  the  inequalities  E2(f,  A)T  <  Sa (/,  T)t  < 
E2(f,  Ha)t,  which  yield  the  following  comparison  theorem. 

Theorem  2.19.  We  have 

II/IIkwt)  < 

and 

||/||a?2(T)  <  c|  ,/'|  /;«(•/  :  <  c\\f\\B^(T). 

We  next  introduce  another  norm  in  B fk(T).  For  /  G  Lj('K(R2),  r)  >  0,  we  define 

j_  1_ 

N«,(/,T):=  (E(IaI'_A(/Aa),)t)*~  (E(IAI'^V(/,S2a)„)t)7.  (2.68) 

a eT  a eT 

Evidently,  Nw,t(/,T)  =  ||/||u»*(r)- 
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To  prove  the  equivalence  of  ||/||bp*(t)  and  Nw^(/,  T)  for  0  <  r)  <  p,  we  need  to  introduce 
one  more  norm  in  Wfi  (T).  For  every  A  G  T,  we  let  Pa,??(/)  be  a  near  best  Z^-approximation 
to  /  on  Oa  from  n*.  with  a  constant  A  which  is  the  same  for  all  Oa,  A  G  T  (see  (2.16)).  We 
define 

P^(/):=P™,,(/,T):=  ^  1a -Pa  ,„(/) 

A  ETm 


tta ,„(/)  '■=  !nA  •  (Pm+i,r,(/)  -  Pa, „(/)),  if  A  G  7^. 
The  new  norm  is  defined  by 


N„,„(/,T):=  (£(  |A| 


1  //'  I  '/  I 


,(/)ii,r) 


(2.69) 


Clearly,  since  T  is  an  SLR-triangulation, 


N^(/,T)  «  (  ^(|A|-a||7TA,IJ(/)||r)T)  (Ell^(/)llpT) 


Theorem  2.20.  For  f  G  ®“fc(T),  l/<e  norms  H/llsp'q-rp  NUfl(/,T)  (0  <  77  <  p),  mid 
N,r^(/,  T)  (0  <  ?7  <  p);  defined  in  (2.67)-(2.69)  are  equivalent  with  constants  of  equivalence 
depending  only  on  a,  p,  k,  rj,  and  the  parameters  of  T . 

Proof.  Using  Holder’s  inequality  and  the  properties  of  the  SLR-triangulations,  we  readily 
obtain 

IW/,T)  <  cN^(/,T),  0  <  r)  <  fx.  (2.70) 

As  we  pointed  out  earlier,  N W,T(/,  T)  =  1 1  / 1 |aq*  (r)  •  Therefore,  it  suffices  to  show  that 

FW  ( /,  T)  «  ( /,  T) ,  for  all  0  <  /i,r)<p. 

From  the  definition  of  Pa,»?(/)  and  7Ta fl(/),  it  follows  that  for  any  A'  G  Tm 

||7TA',r?(/)  ||t,  A  C||/  —  PTO+l,7)(/)||nfl(0Ai)  +  *||/  —  Pa' ,??(/)  ||l,,(Oa/) 

<  C  II  /  Pa,?)  (/)  ||l^(A)  T  cEk  (/,  12 A'  )r/ 

AeTm+i,  acoa/ 

<  e  Ek{f,nA)v  +  cEk ( /,  nA'  )„  • 

Aerm+i,  acoa/ 

Substituting  this  estimate  in  the  definition  of  ( /,  T)  in  (2.69),  we  easily  obtain 


N^(/,T)<cN^(/,T),  r/  >  0. 


We  next  prove  that  if  N^fl(/,  T)  <  oo,  rj  >  0,  then 


bW (/•  T)  <  c (/,  T),  r  <  n  <  p. 


(2.71) 


(2.72) 


Evidently,  (2.70)-(2.72)  yield  the  theorem. 
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We  introduce  the  following  abbreviated  notation:  Pa  :=  Pa ,??(/),  Pm  :=  and 

vta  :=  tta, ,,(/).  We  also  set  pm  :  =  PTO+i  -  PTO  and  pA  :=  1a  •  pm  =  1a  •  tta  for  A  G  Tm, 
Evidently,  ||pa||p  <  IKaIIp,  and,  lienee, 

( E  imi;)1/t  <  ( E  ii^ii;)i/t  ~  tw/.-n  <  oo. 

Aer  Aer 


It  is  readily  seen  that  the  sequence  {'I’m}  :=  {pa}agt  satisfies  conditions  (i)-(ii)  of  Theo¬ 
rem  3.3.  Therefore,  YIaet  IPa(-)|  <  00  a-e-  011  P2>  and 


E  I P- 

ACT 


A  l 


<cN^(/,T). 


(2.73) 


On  the  other  hand,  since  /  G  Lj70C(M2),  \\f  —  PTO||it?  (A)  —>  0  as  m  — >■  oo  for  every  A  G  T. 
Exactly  as  in  the  proof  of  Theorem  2.15,  it  follows  that  /  —  PTO+i  G  Lp( R2)  and 

OO 

f  -^m+l  E  Pj  absolutely  a.e.  on  R2  (2.74) 

j=m+ 1 

and  unconditionally  in  Lp(R2).  Now,  fix  A7  G  Tn,  n  £  27  Since  Pa'  is  a  polynomial  of  degree 
<  k  on  Oa'-  we  have 


&A')n  —  ^k{f  —  Pag &a’)h  <  c||/  —  Pa' ||L^(nA,)-  (2.75) 

Using  (2.75),  (2.74),  and  Theorem  3.3  with  {$,„}  :=  {pA  :  A  G  T,  A  C  0A'},  we  obtain 


Mk(f,^ A')p  <  c||Pn+i  -  PA'||lp(nA,)  +  c  T: 


j=n+l 


<«imi;  +  cE  |  E  Pa 1 1  <  c||7tA'| 

J=n+1  AgTj,AcOa/  /J' 

<C  ^  ||vrA||;<c  ^  |A 


E 


?A 


Act,  AcnA, 


7TA 


T 

n> 


AeT,  AcOa, 


AeT,AcftA, 


where  we  used  Lemma  2.7  and  the  properties  of  the  SLR-triangulations.  Substituting  the 
above  estimate  in  the  definition  of  T),  we  proceed  as  in  the  proof  of  Theorem  2.16, 

to  obtain  (2.72).  □ 


Comparison  of  regular  B-spaces  with  Besov  spaces.  The  Besov  space  Bq(Lp )  = 
Bsq(Lp( R2)),  s  >  0,  1  <  p,  q  <  oo,  is  usually  defined  as  the  set  of  all  functions  /  G  LP(R2) 
such  that 

a 00  r/t\  Ve 

{t,  sujk{f,t,)p)qj)  <  oo  (2.76) 

with  the  Lq-norm  replaced  by  the  sup-norm  if  q  =  oo,  where  k  :=  [s]  +  1  and  cok(fJ.)p  is  the 
A:-th  modulus  of  smoothness  of  /  in  Lp(R2),  i.e.,  iOk{fA)p  :=  sup^^  ||A|(/,  -)||p.  The  norm 
in  Bq(Lp )  is  defined  by 

ll/lk?(c)  :=  II/IIp  +  l/ls|(;4>)- 
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It  is  well-known  that  if  k  in  (2.76)  is  replaced  by  any  other  integer  >  s ,  then  the  resulting- 
space  would  be  the  same  with  an  equivalent  norm.  However,  the  situation  is  different  when 
p  <  1  (see  [PI]).  For  this  reason  we  introduce  k  as  an  independent  parameter  of  the  Besov 
spaces  in  the  next  definition. 

In  this  article,  we  are  interested  in  nonlinear  piecewise  polynomial  (spline)  approximation 
in  LP(M2)  (0  <  p  <  oo).  The  Besov  spaces  B/a(Lr)  with  a  >  0  and  1/t  :=  a  +  1/p  play 
a  distinctive  role  in  this  theory.  Taking  into  account  that  B/a(Lr)  is  embedded  in  Lp  and 
the  above  observation  regarding  the  independence  of  k  and  the  smoothness  parameter,  we 
naturally  arrive  at  the  following  slightly  modified  version  of  the  Besov  space  B/a(Lr): 

Assuming  that  0  <  p  <  oo,  a  >  0,  k  >  1,  and  1/r  :=  a  +  1/p,  we  define  the  Besov  space 
B/a,k(LT)  as  the  set  of  all  functions  /  G  Lp(R2)  (in  place  of  /  G  Lr)  such  that 

a  00  (U  \  1  ' 

{i-2aLUk{fJ)T)T y)  <  oo.  (2.77) 

Notice  that  the  B-spaces  and  Besov  spaces  are  normalized  differently  with  respect  to  the 
smoothness  parameter.  Thus,  e.g.,  the  fat  B-space  Mfk(T)  corresponds  to  the  Besov  space 
B/a'k(Lr). 

From  the  properties  of  (jk(f,t)T,  it  readily  follows  that 

ll/ll*^,  «  (  £(22“”Vi(/,2-”*)Tr)1/T.  (2.78) 

meZ 


Next,  we  give  an  equivalent  norm  for  the  Besov  space  B/a'k(Lr )  in  terms  of  local  poly¬ 
nomial  approximation.  We  let  D'm  denote  the  set  of  all  dyadic  squares  I  of  the  form 


I 


'  v  —  1  v  \ 

2  m  ’  2m  ) 


X 


'//  —  1  jl  \ 

9 m  ’  2m ) ' 


v,  H  G  Z, 


and  let  D"  be  the  set  of  all  shifts  of  /  G  D'm  by  the  vector  e  :=  (2  m  1 . 2  m  1 ).  i.e. , 
T>"  :=  {/  +  e  :  /  G  D'm  \.  We  denote  Drn  :=  D'm  U  D'/n  and  D  :=  We  now 

introduce  the  following  norm 

N(/)  :=  (^(|/|-^(/,/)9t)1/T  «  (^(l^r^(/.Ur)T)1/T,  (2.79) 

ieD  ieD 


where  Ek(f,I)r  is  the  error  of  LT(/)-approximation  to  /  from  n^.. 
Lemma  2.21.  If  f  G  B2a'k(LT),  then 


N  (/)  *  ll/ILw-i^ 

with  constants  of  equivalence  depending  only  on  p,  a,  and  k. 

Proof.  This  lemma  is  well  known  and  fairly  easy  to  prove.  Its  proof  hinges  on  the  following- 
equivalence: 

MfJVr  ~  4-  f  I  I Akh(f,x,I)\Tdxdh,  (2.80) 

1  J[0, £(!)}%  Jl 
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where  1(1)  :=  | i")1/2  and  A \(f,x,I)  :=  ^*L0(— l)k+i (^)f(x  +  jh)  if  [,x,x  +  A:h]  C  /  and 
/)  :=  0  otherwise  (see  [PP]  for  the  proof  of  (2.80)  in  the  univariate  case;  the  same 
proof  applies  to  the  multivariate  case  as  well).  (See  also  [T]).  □ 

We  next  consider  B-spaees  over  regular  triangulations  (see  §2.1). 

Theorem  2.22.  If  T*  is  a  regular  triangulation  then  B fk(T*)  =  B^.a'k(LT)  with  equivalent 
norms. 

Proof.  This  theorem  is  an  immediate  consequence  of  Lemma  2.21  and  the  following  lemma. 

□ 


Lemma  2.23.  Suppose  T*  is  a  regular  triangulation  with  minimal  angle  (3  >  0.  Then  there 
exists  '/'o  =  i o(/3)  such  that  the  following  hold: 

(a)  If  I  G  Dm  (m  G  Z),  then  there  exists  A  G  T*  such  that  I  C  Ha  and  |A|  <  2_2m+*°. 

(b)  If  A  G  T*  and  2_2m  <  |A|  <  2_2m+2,  then  there  exists  I  G  such  that  Ha  C  /. 


Proof.  The  proof  of  this  obvious  lemma  will  be  omitted.  □ 

Exactly  as  in  the  case  of  B-spaces,  we  introduce  the  following  norm  in  the  Besov  space 
Bla'k{Lr):. 


ieD 


n,(/)  :=  (£(iWA(/,/)„r)7 « (£(|/Ti£i(/,/)„r) 

ieD 

which  in  integral  form  gives 

N„(/)  « 


(2.81) 


0  .7i82 


t2(lp  lr,)uk{f,Bt{x))r 


t  3  dx  clt 


(2.82) 


where  Bt(x)  :=  {y  G  R2  :  \\y  -  x\\2  <  t}  or  Bt(x)  :=  {ye  R2  :  \\y  -  x^  <  t}. 


Proposition  2.24.  The  norms  N^(-)  with  0  <  rj  <  p  and  ||  •  \\Bi a,k^L  }  are  equivalent. 

Proof.  Using  Lemma  2.23  as  in  the  proof  of  Theorem  2.22,  one  can  show  that  N^(-)  ~ 
N^(-,r*)ifT*  is  a  regular  triangulation.  From  Theorems  2.20  and  2.22,  we  obtain 


Nw,7)(',7_*)  ~  ||  •  ||3Pfc(T*)  ~ 


□ 


Remark.  This  result  is  (in  essence)  well  known,  see  [T]  and  the  references  therein.  The 
equivalence  of  N^(-)  and  ||  •  || ^2,,./,^  ^  clearly  shows  the  intimate  relation  of  B-spaces  with 
Besov  spaces. 

Our  last  goal  in  this  section  is  to  find  the  range  for  the  smoothness  parameter  a,  where 
the  Besov  R2"-spaces  coincide  with  the  corresponding  slim  or  skinny  B-spaces  over  regular 
triangulations. 
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Theorem  2.25.  Suppose  T*  is  a  regular  triangulation  of  M2 ,  0  <  p  <  oo,  and  k  >  1. 

(a)  If  0  <  a  <  1  +  1  /p  and  1/r  :=  a  +  l/p,  then  f  £  Bf(T*)  if  and  only  if  f  £  B/a'2(Lr), 
and 

(2.83) 


I  B2Ta’2(Lr)  ~  I  \J  I \m(T*) 

with  constants  of  equivalence  depending  only  on  p,  a  and  /3  =  f3(T*)-  This  equivalence  is 
no  longer  true  if  a  >  1  +  1/p.  Moreover,  for  every  9  £  @(T*)  and  a  >  1  +  1  /p,  we  have 

\\po\\ B+’2(Lt)  =  00  >  while  I  r»l  /.-(7  m  ~  |Mj>- 

(b)  If  0  <  a  <  1/p  and  1/r  :=  a  +  1/p,  then  f  £  Bf'k(T*)  if  and  only  if  f  £  B/a,k(LT), 
and 

II/IIs+’*(l+  ~  ll/lli+fc(T*)  (2.84) 

tvith  constants  of  equivalence  depending  only  on  k,  p,  a  and  /3  =  f3(T*)-  This  equivalence  is 
no  longer  true  if  a  >  1/p.  Moreover,  for  every  A  £  T*  and  a  >  1/p,  we  have  ||  ^  = 

oo;  while  ||1a||b+(t*)  ~  ||1a||p- 

Proof,  (a)  From  Theorems  2.19  and  2.22,  we  have  H/Hb+t*)  A  cll/|ls2a’2(LT)  f°r  a  >  0.  We 
next  show  that 

V\\b?*M  <  «  ll/llw),  if  o  <  a  <  1  +  1/p.  (2.85) 

Let  f  £  Bf  ( T' ) .  Then  by  Theorems  2.15-2.16,  and  (2.38),  it  follows  that  /  can  be  repre¬ 
sented  in  the  form 

f  =  'S^/bepe  absolutely  a.e.  on  R2  (2.86) 

oe& 

and 

ll/llff?{T*)  ~  (y^  \\bepe\\l)  i  (2-87) 

ee@ 

where  0  :=  @(T*)- 

Denote  3?  :=  {()  £  O  :  2~2j  <  \9\  <  2-2^-1)}.  Since  T*  is  regular,  straightforward 
calculations  show  that,  for  each  6  £  0, 


^2 


|0|(l-r)/2  +l+r,  if  0  <  t  <  |0|1/2, 

\e\,  ift  +  l^l1/2, 


and  hence,  for  9  £  3,  and  t  >  0, 

MboPoBVr  ~  min{||6,^|r  •  2  T'>  1  1  "b/'W  \\bm\\;  •  2~^},  (2.88) 


where  we  used  that  1/r  =  a  +  1/p. 

Denote  fj  :=  hope-  Since  T*  is  regular,  ff{9  £  3.;  :  x  G  9}  <  c(j3)  for  x  £ 

j  £  Z.  Therefore, 


and 


(2.89) 


U2{fj,t)TT<c^2uj2{bepo,t)TT,  j  £  Z. 

oezj 

From  (2.88)-(2.89),  we  derive  that  for  any  fixed  rn  £  Z 

^(/j.  2-"*);  <  e  5]  •  2-”,|1+T)||6#V)»||;,  if.,  <  m,  (2.90) 


9eZj 
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and 


(2.91) 


«»</,.  2~*K  <*E  ^  3  >  m. 

eesj 

Let  A  :=  min{r,  1}.  Then,  using  (2.90)-(2.91),  we  have 


j'GZ 


TO— 1 


A  OO 


<  C 


E  ( E  2-y("-1-',T-’"(1+T)ii6»v»n;)7 +cE  ( E  2~32°tiiw»ii;) 

j=-oo 

Substituting  this  in  (2.78),  we  obtain 


j=m  0e~j 


X  .  21 
A 


B2Ta'2(LT)  - 


<  *e2’”i"-1-'St(e  (2-^i-^eii^«ii;)-3 


toGZ 


J=-°o 


^  .  21 


:  ^  2 -'2~(  e  ii^ii;)  7)  1 


m£Z 


3~m 


eeZi 


where  we  used  that  2ot  —  r  —  1  =  r(o  —  1  —  1/p )  since  1/t  —  a  1/p.  To  estimate  the 
above  sums,  we  use  the  well  known  discrete  Hardy  inequalities.  Namely,  we  apply,  e.g.,  the 
inequality  from  Lemma  3.10  of  [PP]  to  estimate  the  first  sum  and  Lemma  3.4  from  [DL]  to 
the  second  sum.  We  obtain 


?2a,2(T  \  _E 


(Lr) 


<  c 


EE  \\bo<Po\\ 

0  G 


p< 


which  completes  the  proof  of  (2.85). 

Using  (2.88),  we  obtain 

/■°°  fit 

\Wo\\tb^Lt)  =  Jo  (t-2aco2(ve,t)Ty  j 

rW/2  r°° 

«  \e\(1~T)/2  t(~2a+1)T  dt  +  \e\  r2aT~ldt 

J  0  ./  0  '  J-i 

\e\l>2 

t(-2a+1)r  dt  +  \e\r/p. 

Therefore,  \\lfio\\B2a^(LT)  =  00  if  (— 2a  +  l)r  <  —1  which  is  equivalent  to  a  >  1  +  1/p ,  using 
that  1/r  =  a  +  1/p.  It  is  easily  seen  that  ||pe||fl“(r*)  ~  which  follows  from  the 

Bernstein  inequality  in  Theorem  3.7  as  well. 

(b)  Simple  calculations  show  that  u^(1aU)t  ~  min{|A|1//2t,  | A | }  for  A  G  T*  and  t  >  0. 
The  rest  of  the  proof  is  similar  to  the  proof  of  part  (a)  and  will  be  omitted.  □ 


Comparison  between  B-spaces  over  different  triangulations  and  Besov  spaces. 

Suppose  T  is  an  arbitrary  strong  locally  regular  triangulation  of  R2  (§2.1)  and  0  <  p  <  oo. 
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It  can  be  proved  that  there  exists  =  ao(p,  [3,  M0)  >  0  such  that  if  0  <  a  <  «o  and 
/  G  R“(T)  with  1/r  :=  a  +  1/p,  then 

II/IIb?2(t)  <  c||/||B?(r)- 

We  leave  the  proof  of  this  result  for  elsewhere  since  it  is  much  longer  and  more  involved  than 
the  proof  of  Theorem  2.25.  Thus  the  fat  B-norm  ||/||Bp2(r)  is  equivalent  to  the  slim  B-norm 
||/||s“(r)  f°r  some  relatively  small  range  0  <  a  <  a0  and  becomes  much  larger  when  a  >  tt0. 
The  relationship  between  fat  and  skinny  B-spaces  is  quite  similar.  We  skip  the  details. 

It  is  essential  for  our  theory  that  the  Courant  elements  pg,  9  G  0(T),  have  infinite 
smoothness  (smoothness  of  order  a  >  0  for  every  a)  in  the  slim  B-space  scale  R"(T).  At  the 
same  time  each  pg  has  limited  smoothness  a  <  «o  in  the  corresponding  fat  B-space  scale. 

If  one  compares  a  B"-space  over  an  arbitrary  triangulation  with  the  corresponding  Besov 
space  B2a,k(Lr)  (or  two  B-spaces  over  different  triangulations  with  each  other),  then  ev¬ 
erything  changes  dramatically.  As  was  shown  in  §2.1,  there  exist  strong  locally  regular 
triangulations  with  extremely  skinny  Courant  elements  which  cause  problems  to  Besov 
spaces.  More  precisely,  let  pg  be  the  Courant  element  associated  with  a  cell  9  G  0  which 
is  convex,  has  length  l  >  0  and  width  el  with  0  <  e  <  1.  Simple  calculations  show 
that  lo2 {pg,t)TT  ~  min {e~Tl1~Tt1+T  ,el2}.  Furthermore,  we  have  \\Lpg\\B2a,2^L^  ~  e_Q!||+6i||j)  if 
0  <  a  <  1  +  1/p  and  \\pg\\Biajt^L^  =  00  if  a  >  1  +  1  /-p.  At  the  same  time,  ||v?e||B“(r)  ~  ||+o||p 
for  each  a  >  0.  Therefore,  even  for  small  a  the  Besov  norm  of  a  Courant  element  can  be  huge 
in  comparison  to  its  Lp-norm.  This  is  why  the  Besov  spaces  are  completely  unsuitable  for  the 
theory  of  n-term  Courant  element  approximation  in  the  case  of  nonregular  triangulations. 

B-spaces  in  dimensions  d  7^  2.  Slim,  skinny,  and  fat  B-spaces  in  d  dimensions  (d  >  2) 
can  be  defined  and  utilized  similarly  as  in  the  two-dimensional  case.  We  do  not  consider 
them  in  the  present  article  simply  to  avoid  some  complications  that  are  unnecessary  at  this 
point.  Of  course,  the  B-spaces  can  be  defined  in  the  univariate  case  as  well.  However,  it 
can  be  shown  that  the  univariate  slim,  skinny,  and  fat  B-spaces  do  not  give  anything  better 
than  the  corresponding  Besov  spaces  if  0  <  p  <  00  and,  therefore,  are  useless.  The  point 
is  that  in  the  univariate  case  the  Bernstein  inequality  holds  with  no  restrictions  on  a  >  0 
(see  [PI]).  In  the  case  of  p  =  00,  however,  the  B-spaces  are  different  from  the  corresponding 
Besov  spaces. 

3  Nonlinear  piecewise  polynomial  approximation 

In  this  section,  we  give  our  main  results  for  nonlinear  n-term  approximation  in  Lp(R2)  (0  < 
p  <  00)  from:  (a)  Courant  elements  generated  by  LR-triangulations  and  (b)  discontinuous 
piecewise  polynomials  over  WLR-triangulations. 

3.1  Nonlinear  n-term  approximation:  General  principles 

We  begin  with  a  brief  description  of  the  general  principles  that  will  be  guiding  us  in  devel¬ 
oping  the  theory  of  nonlinear  n-term  approximation  by  piecewise  polynomials. 
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Let  X  be  a  normed  or  quasi-normed  function  space,  where  the  approximation  will  take 
place  (in  this  article,  X  =  Lp( M2),  0  <  p  <  oo).  Suppose  <f>  =  {p>o}oee  is  a  collection  of 
elements  in  X  which  is,  in  general,  redundant,  and  we  are  interested  in  nonlinear  n-term 
approximation  from  <J>.  We  let  En  denote  the  nonlinear  set  of  all  function  S  of  the  form 

S  =  ^2  a9(pe, 

oe  An 

where  An  C  0,  #An  <  n,  and  An  varies  with  S.  The  error  of  n-term  approximation  to 
/  £  X  from  <f>  is  defined  by 

°n{f)  ■=  inf  ||/  -  S\\x. 

Our  main  objective  in  this  article  is  to  describe  the  spaces  of  functions  of  given  rates  of 
n-term  approximation.  More  precisely,  we  want  to  characterize  the  approximation  space 
Aq  :=  M^($),  7  >  0,  0  <  q  <  oo,  consisting  of  all  functions  /  £  X  such  that 

ll/Ik  :=  ll/IU-  +  (f >V„(/))«i)1/'  <  oo  (3.1) 

n—  1 

with  the  iq- norm  replaced  by  the  sup-norm  if  q  =  oo.  Thus  is  the  set  of  all  /  £  X  such 
that  crn(/)  <  cn~"< . 

To  achieve  our  goals,  we  shall  use  the  machinery  of  Jackson  and  Bernstein  estimates  plus 
interpolation  spaces.  Suppose  B  C  X  is  a  smoothness  space  with  a  (quasi-)norm  ||  •  ||B, 
satisfying  the  A-triangle  inequality:  \\f  +  g\\g  <  ll/lln  +  II^IIb  with  0  <  A  <  1  (in  our  case, 
B  will  be  some  B-space),  and  let  <S>  C  B.  The  il -functional  is  defined  by 

K(f,  t )  :=  K(f,  t-  X,  B )  :=  inf  (||  /  -  ^||x  +  t\\g\\B%  t  >  0. 

geB 

The  interpolation  space  (X,B)^q  (real  method  of  interpolation)  is  defined  as  the  set  of  all 
/  £  .Y  such  that 


l/<7 

II/II(*,b>„  :=  ll/IU-  +  (  (/,  2-)]«)  <  oo,  0  <  fl  <  1, 

m=0 


where  the  tq- norm  is  replaced  by  the  sup-norm  if  q  =  oo  (see,  e.g.,  [BL,  BS]). 

The  well  known  machinery  of  Jackson  and  Bernstein  estimates  allows  to  characterize  the 
rates  of  n-term  approximation  from  <J>: 


Theorem  3.1.  (a)  Suppose  the  following  Jackson  estimate  holds:  There  is  a  >  0  such  that 
for  f  £  B 

vn{f)  <  cn~a\\f\\B,  n  >  1.  (3.2) 


Then,  for  f  £  A", 


o»(/)  <  cK{f,n  “),  n  >  1. 

(b)  Suppose  the  following  Bernstein  inequality  holds:  There  is  a  >  0  such  that 


(3.3) 


||5||b  <  Bn*||5||x,  for  S  e  n  >  1. 


(3.4) 
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Then,  for  f  G  X , 


/r  ”  1  -|  i/a  x 

A'(/,n-“)<cn-«([^-K£r„(/))AJ  +H/IU-),  ft  >  1 .  (3.5) 

Proof.  For  the  proof  of  this  theorem  see,  e.g.,  [PP].  □ 

An  immediate  consequence  of  Theorem  3.1  is  that  if  the  Jackson  and  Bernstein  inequali¬ 
ties  (3.2)  and  (3.4)  hold,  then  an(f)  =  0(n-7),  0  <  7  <  a,  if  and  only  if  K(f,  n~a )  =  0(n-7). 
More  generally,  Theorem  3.1  readily  yields  the  following  characterization  of  the  approxima¬ 
tion  spaces  A7(<F): 

Theorem  3.2.  Suppose  the  Jackson  and  Bernstein  inequalities  (3.2)  and  (3.4)  from  Theo¬ 
rem  3.1  hold.  Then 

Aq{®)  =  (X,  B)zg,.  0  <  7  <  a,  0  <q<  00, 
with  equivalent  norms. 

General  embedding  theorem  and  Jackson  estimate  for  nonlinear  n-term  approx¬ 
imation. 

Theorem  3.3.  Suppose  { <hm }  is  a  sequence  of  functions  in  Lp(Rd),  d  >  1,  0  <  p  <  00, 
which  satisfies  the  following  additional  properties  when  1  <  p  <  00  : 

(i)  <Fm  G  L^CW1),  supp  <f>TO  C  Em  with  0  <  \Em\  <  00,  and 

H'f’mHoo  A  C\\Em\  ^P||$TO||p. 

(ii)  If  x  G  .  then 

Y.  (|Am|/|A,|)1/p  <  ci, 

Ej3x ,  |Sj|>|Sm| 

where  the  summation  is  over  all  indices  j  for  which  Ej  satisfies  the  indicated  conditions. 
Denote  ( formally )  /  :=  (I>m  an<^  assume  that  for  some  0  <  r  <  p 

w):=(17i<m;;)1/t<c*.  (3.6) 

m 

Then  l$m(  •)|  <00  a. e.  on  M.d ,  and  hence,  f  is  well-defined  on  ,  f  G  Lp(Rd),  and 

||/||p<||X>™(OI  <cN(f)  j  (3,7) 

II  -  P 

m 

where  c  =  c(a,p,  c\). 

Furthermore,  if  1  <  p  <  00,  condition  (3.6)  can  be  replaced  by  the  weaker  condition 

N(f )  :=  ||{||<Mp)IU  <  00,  (3.8) 

where  ||{:rm}||wer  denotes  the  weak  iT-norm  of  the  sequence  {xm}  : 

IIKJIU  :=  inf{M  :  #{m  :  \xm\  >  Mn  }  <  n  for  n  =  1,2,.. (3.9) 
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Theorem  3.4.  Under  the  hypothesis  of  Theorem  3.3,  suppose  is  a  rearrangement 

of  the  sequence  {$m}  such  that  ||$*||p  >  >  . . ..  Denote  Sn  :=  Y^j=i  Then 

11/  —  Sn\\p  <  cn~aN(f )  with  a  =  1/r  —  1/p ,  (3.10) 

where  c  =  1  if  0  <  p  <  1  and  c  =  c(a,  p,  c\)  if  1  <  p  <  oo. 

Furthermore,  the  estimate  remains  valid  if  condition  (3.6)  is  replaced  by  (3.8)  when  1  < 
p  <  oo. 

Proof  of  Theorems  3.3  and  3.4.  Case  T.  0  <  p  <  1.  Since  r  <  p,  we  have 

||£hu-)i||  <  (EibM?)17^  (£ii*-ii;)1/t  =  jv(/) 

m  P  m  m 

which  proves  Theorem  3.3  in  this  case.  To  estimate  ||/  —  5„||p  we  shall  use  the  following- 
simple  inequality:  If  X\  >  %2  >  . . .  >  0  and  0  <  r  <  p,  then 

(3.11) 

j=n+l  j=l 

The  proof  of  this  inequality  is  given  in  the  appendix  (§4.2).  Applying  (3.11)  with  Xj  :=  ||$^||p, 
we  obtain 

oo  oo  i  / 

11/ -S.it  <  I  £  i*;(-)i|  <  (  £  H4«)  ' 

j=n+ 1  j=n+ 1 

OO  / 

<  n1/''-1/T(  Elicit)  =n-“N(f), 

./•-I 

which  proves  Theorem  3.4  in  Case  I. 

Case  IT.  1  <  p  <  oo.  We  need  the  following  lemma: 

Lemma  3.5.  Let  F  :=  Yljejn  l^jl;  where  ffjn  <  n,  and  ||$j||p  <  L  for  j  G  Jn ■  Then 

ll^llp  <  cLn ^ 


with  c  =  c(p,  ci). 

Proof.  Let  1  <  p  <  oo  (the  case  p  =  1  is  trivial).  Using  property  (i)  of  the  sequence  {$TO}, 
we  have  _ 

MiP< |  E -w-)  <cL |  E 1^(->  - 

II  -  p  II  -  p 

jeJn  jeJn 

We  define  E  :=  (J  -eJn  Ej  and  £(x)  :=  min{|i/?  :  j  G  Jn  and  Ej  3  x}  for  x  G  E.  Property 
(ii)  yields  Yljejn  •  1  Ej{x)  A  C\£(x)~l^p  for  x  G  M2.  Therefore, 


l|S%  <  cH\£(.)-Vp\\Lr  =  cl(  Jj(x3  dx)''” 

<  cL  (E  \Ej  I"1  f  lEj  (X)  dx) 1/P  =  cL{ffJnflp  <  cLnl/p.  □ 

•  ST  J  IR2 
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We  define  :=  {.,  :  2 ~»N(f)  <  ||<hjp  <  2^+1iV(/)}.  Then  \JV<„%  =  O'  =  \\$j\\p  > 
2 ~,1N(f)}  and  hence,  using  (3.6)  or  (3.8),  we  derive 

^#3,  =  #(|JS,)  <T\  (3.12) 

v< /j,  u<n 

Therefore, 

#S^<^#3,  <  2'W  (3.13) 

U</1 

We  denote  M  :=  By  (3.12),  M  <  2TOT.  Let  Fp  :=  ^W£3  |$j|.  Using  Lemma  3.5 

and  (3.13),  we  obtain 

OO  OO 

ll/-SM||r  <  £  f„  <  £  ll^ll, 

- 

/x=m+l  /x=m+l 

OO  OO 

<  c  5]  2~'*N(f)(#Eli)1'*  <  cN(f)  2-/4 (1'T/P) 

/i  m  •  1  /i  11/ j  1 

=  cN{f) 2-WWUp)  <  cM-VT+1/PiV(/)  =  cM~aN(f). 

This  estimate  readily  implies  (3.10).  Evidently,  (3.7)  is  also  contained  in  the  above  result 
(take  SM  :=  0).  This  completes  the  proofs  of  Theorems  3.3  and  3.4.  □ 

As  will  be  seen  in  §3.2  and  §3.3,  Theorem  3.4  easily  gives  the  needed  Jackson  estimates 
for  piecewise  polynomial  approximation  (see  Theorems  3.6  and  3.10).  However,  there  is  no 
simple  recipe  for  proving  Bernstein  estimates  (see  §4.1). 

3.2  Nonlinear  n-term  Courant  element  approximation 

In  this  section,  we  assume  that  T  is  a  locally  regular  triangulation  of  M2.  We  denote  by 
$7-  the  collection  of  all  Courant  elements  ipe  generated  by  T  (see  §2.1).  Notice  that  $7-  is 
not  a  basis;  $7-  is  redundant.  We  consider  the  nonlinear  n-term  approximation  in  Lp(R 2) 
(0  <  p  <  00)  from  $7-.  Our  main  goal  is  to  characterize  the  approximation  spaces  generated 
by  this  approximation.  We  let  En(T)  denote  the  nonlinear  set  consisting  of  all  continuous 
piecewise  linear  functions  S  of  the  form 

S  =  y  cioVo, 

6e  An 

where  An  C  @(T),  #An  <  n,  and  An  may  vary  with  S.  We  denote  by  an{f,T)p  the  error  of 
Tp-approxi mati on  to  /  €  Lp(R 2)  from  En(T): 

*n(f,T)p:=  inf  ||/  —  S\\p. 

SE'Sn(T) 

Throughout  this  section,  we  assume  that  0  <  p  <  00,  a  >  0,  and  1/t  :=  a  +  l/p,  and  denote 
by  B®(T)  the  slim  B-space  introduced  in  §2.3.  We  next  prove  a  pair  of  companion  Jackson 
and  Bernstein  estimates. 
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Theorem  3.6.  [Jackson  estimate]  If  f  G  Bf(T),  then 

vn{f,T)p  <  cn-a\\f\\B?(T)  (3.14) 

with  c  depending  only  on  a,  p,  and  the  parameters  of  T ■ 

Remark.  Estimate  (3.14)  remains  valid  if  ||/||_Ba(r)  is  replaced  by  ||{||<W0||p} ll«^T  with  {bo} 
from  (2.32)  or  (2.33)  as  in  the  definition  of  1VqiT(/)  (see  (2.34)),  where  ||  •  \\wiT  is  the  weak 
fT-norm  defined  in  (3.9). 

Proof.  By  Theorem  2.15,  it  follows  that 

/  =  £  bo{f)<Po  absolutely  a.e.  on  M2, 
oeQ 

where  {&<?}  are  from  (2.32)  or  (2.33).  We  use  Theorem  3.4,  (2.38),  and  Theorem  2.16  to 
obtain 

on{f,T)p  <  cn~a(^  ||M/)Pfl||p)  7  ~  cn~aNQ'T{f)  zs  cn““||/||B?(r).  □ 

OeQ 

Theorem  3.7.  [Bernstein  estimate]  If  S  G  £n(T),  then 

II5IIb?(T)  <  cna\\S\\p  (3.15) 

with  c  depending  only  on  a,  p,  and  the  parameters  of  T ■ 

The  proof  of  this  theorem  is  more  involved  than  the  one  of  Theorem  3.6.  We  shall  give 
it  in  the  appendix  (§4.1). 

We  denote  by  AJ  :=  A^(Lp,T)  the  approximation  space  generated  by  n-term  Courant 
element  approximation  (see  (3.1)).  The  Jackson  and  Bernstein  estimates  from  Theorems  3.6 
and  3.7  yield  the  following  characterization  of  the  approximation  spaces  A^(Lp,T)  (see 
Theorem  3.2): 

Theorem  3.8.  7/0  <  7  <  a  and  0  <  q  <  00,  then 

A}(Lp,T)  =  (Lp,Bf(T))hq 

with  equivalent  norms. 

“Algorithm”  for  nonlinear  n-term  Courant  element  approximation.  One  of  our 

primary  motivations  for  this  work  was  the  development  of  methods  for  n-term  Courant 
element  approximation  which  capture  the  rates  of  the  best  approximation.  The  proofs  of 
Theorems  3.3  and  3.6  suggest  the  following  approximation  scheme,  where  we  assume  that 
/  e  Z/P(M2),  1  <  p  <  00,  and  T  is  a  fixed  LR-triangulation  of  R2: 

Step  1.  We  use  the  operators  qm(f )  :=  qm{fi  T)  induced  by  the  quasi-interpolant  (see 
(2.31))  to  find  the  following  decomposition  of  /: 

/  =  £  QmU)  =  £  £  b0{f)<p0, 

me%  mez  oeQm 
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where  {&<?(/)}  are  defined  by  (2.32)  and  the  identity  was  established  by  Theorem  2.15. 

Step  2.  We  order  the  terms  {bo{f)‘fo}eee  in  a  sequence  {bgj  {f)tpoj  }°fi1  such  that 

IMM  >  \K(fMP  >■■■■ 

Then  we  define  the  n-term  approximant  by 

n 

An(f)P  =  An(f,T)p:=J2b0j(f)V0j. 

j= i 

This  procedure  becomes  practically  feasible  in  the  setting  of  approximation  of  functions 
defined  on  compact  polygonal  domains. 

By  Theorem  3.4,  it  follows  that 

\\f  -  An{f)p\\P  <  cn-a\\f\\Ba(T). 

If  0  <  p  <  1,  we  use  the  more  complicated  nonlinear  operators  (v  <  p)  from  (2.31) 

instead  of  qm{f )  and  the  coefficients  bo{f )  :=  bo defined  in  (2.33).  The  same  estimate  for 
the  error  holds  again  by  Theorem  3.4. 

These  results  imply  that  the  above  algorithm  achieves  the  rates  of  the  best  n-term 
Courant  element  approximation.  We  shall  further  elaborate  on  this  in  a  forthcoming  ar¬ 
ticle. 

n-term  approximation  from  the  library  We  denote  by  dn(/)p  the  error  of 

n-term  approximation  to  /  G  Lp( R2)  from  the  best  Courant  element  collection,  i.e., 

d«.(/)p  •  inf  dn(/,  T)p, 

where  the  infimum  is  taken  over  all  LR-triangulations  T  with  some  fixed  parameters  M0, 
Ay.  r,  p ,  and  S.  The  following  result  is  immediate  from  Theorem  3.6. 

Theorem  3.9.  Suppose  inf^  ||/||n“(r)  <  where  the  infimum  is  taken  over  all  LR- 
triangulations  with  some  fixed  parameters  M0,  N0,  r,  p,  and  8,  and  let  f  G  LP(R2).  Then 

Sn{f)p  <  cn“a  inf  || /llmcr), 

where  c  depends  on  a,  p,  and  the  parameters  M0,  N0 ,  r,  p,  5. 

It  is  an  open  problem  to  characterize  the  rates  of  approximation  generated  by  {a n(f)p}- 
The  difficulty  stems  from  the  highly  nonlinear  structure  of  approximation  from  the  library 
{$r}r- 

3.3  Nonlinear  approximation  from  (discontinuous) 
piecewise  polynomials 

In  this  section,  we  assume  that  T  is  a  weak  locally  regular  triangulation  of  M2  (§2.1).  We 
denote  by  S^(T)  ,  k  >  1,  the  nonlinear  set  of  all  n-term  piecewise  polynomial  function  of  the 
form 

5  =  1A  •  PA, 

aga„ 
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where  Pa  €  1R,  An  C  T,  #An  <  n,  and  An  may  vary  with  S.  We  denote  by  crn(f,T)p  the 
error  of  Lp-approxi rnati on  to  /  G  Lp(R 2)  from  E*(T): 

We  want  to  characterize  the  approximation  spaces  generated  by  crn(f,T)p-  To  this  end  we 
shall  proceed  according  to  the  recipe  from  §3.1.  We  shall  first  prove  Jackson  and  Bernstein 
estimates.  Throughout  the  rest  of  the  section,  we  assume  that  0  <  p  <  oo,  A:  >  1,  a  >  0, 
and  1/r  =  a  +  1/p.  Recall  that  Bf'k(T)  denotes  for  the  skinny  B-space  introduced  in  §2.4. 

Theorem  3.10.  [Jackson  estimate]  If  f  G  Bfk{T),  then 

On{f,T)p  <  cn~a\\f\\Bak(T) 

with  c  depending  only  on  p,  a,  k,  and  the  parameters  of  T ■ 

Remark.  The  conclusion  of  Theorem  3.10  remains  valid  if  ||/||b“*(t)  is  replaced  by  the  weak 
4-norm  ||  {pa,j?(/)}agtIU^  of  the  sequence  {/t\.,,(/)  }ac  7  •  0  <  7]  <  p,  defined  in  (2.61)  (see 
also  (3.9)  for  the  definition  of  ||  •  \\wiT). 

Proof.  By  Theorem  2.17,  we  have  /  =  J2AeTpA  absolutely  a.e.  on  R2  and  ||/||b«*(t)  ~ 
(Sa er  \\'Pa\\tp)1/t,  where  pA  ■=  PA,ri(f)  (0  <  rj  <  p)  are  from  (2.61).  Evidently,  the  sequence 
{Tj}  :=  {pa}a£T  satisfies  the  requirements  of  Theorem  3.3  and,  therefore, 

^n(/,T)p  <  cn““(  ^  ||pa||p)  7  <  cn-a\\f\\B«k(T).  □ 

AeT 

Theorem  3.11.  [Bernstein  estimate]  If  S  G  Hk(T),  then 

I|S||b?*(T)  <  cn“||S||p  (3.16) 

with  c  depending  only  on  p,  a,  k,  and  the  parameters  of  T ■ 

We  shall  give  the  proof  of  this  theorem  together  with  the  proof  of  Theorem  3.7  in  the 
appendix  (§4.1). 

Now,  we  denote  by  A 1  :=  A^(Lp,T)  the  approximation  space  generated  by  {crn(/,  T)p} 
(see  (3.1)).  The  following  characterization  of  the  approximation  spaces  AJ  follows  by  Theo¬ 
rems  3.10  and  3.11  (see  Theorems  3.1  and  3.2): 

Theorem  3.12.  If  0  <  7  <  a  and  0  <  q  <  oo,  then 

A’q{Lp,T)  =  {Lp,B?{T))u 

with  equivalent  norms. 

Similarly  as  in  the  previous  section,  we  set 

crn(/)p  :=  i^f  <Jn(f,  T)p, 

where  the  infimum  is  taken  over  all  WLR-triangulations  T  with  some  fixed  parameters  r 
and  p.  The  following  result  is  immediate  from  Theorem  3.10. 
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Theorem  3.13.  Suppose  inf-/-  H/Hb^ct)  <  °°>  where  the  infimum  is  taken  over  all  WLR- 
triangulations  with  some  fixed  parameters  r  and  p,  and  let  f  G  LP(R2).  Then 

°n{f)P  <  <■"  "  inf  WIWb-Ht)- 

It  is  an  open  problem  to  characterize  the  rates  of  approximation  generated  by  {a n(f)p}. 

“Algorithm”  for  nonlinear  n-term  piecewise  polynomial  approximation.  We  as¬ 
sume  that  /  G  Lp(R2),  0  <  p  <  oq,  and  T  is  an  arbitrary  WLR-triangulation  of  M2.  The 
proofs  of  Theorems  3.3  and  3.10  suggest  the  following  approximation  scheme: 

Step  1.  We  use  the  local  polynomial  approximation  to  obtain  the  following  decomposition 
of/: 

/  =  P™M )  = 

me%  A eT 

where  pA,p{f)  =  1a  ■  Pm,fif)  if  A  €  Tm,  and  r/  <  p  (see  Theorem  2.17). 

Step  2.  We  order  the  terms  {pa,j?(/)}agt  in  a  sequence  {paj4//)}/Li  such  that 

\\PAuV{f)\\P  >  \\pa2M)\\p  >■■■■ 

Then  we  define  the  n-term  approximant  by 

n 

An{f)p  =  An(f,  T)p  :  ,,(/)• 

r  - 1 

By  Theorem  3.10  and  its  proof,  it  follows  that,  for  /  G  B^k(T), 

II/-  An{f)p\\p  <  cn-“||/||B?*(r). 

Haar  bases  generated  by  general  triangulations.  An  important  point  in  this  article 
is  that  we  carry  out  here  nonlinear  n-term  approximation  without  using  bases.  In  the 
exceptional  case  of  nonlinear  approximation  from  piecewise  constants,  however,  Haar  bases 
can  be  constructed  and  utilized  for  nonlinear  n-term  approximation  in  Lp,  1  <  p  <  oo. 
To  make  it  simple,  suppose  that  T  is  a  weak  locally  regular  triangulation  of  R2  which  is 
obtained  by  the  standard  refinement  scheme  described  in  §2.1:  Every  triangle  A  G  T  has 
four  children  obtained  by  choosing  a  point  on  each  edge  of  A  and  joining  these  points  by  line 
segments.  Denote  by  Ai, . . . ,  A4  the  children  of  A  so  that  A4  is  the  triangle  in  the  middle 
(with  its  vertices  on  the  three  edges  of  A).  We  associate  with  A  the  following  three  Haar 
functions:  HA, i  :  =  |Ai|  11a1  —  |A  \  A4|  11a\a,,  Ha,2  | A2 1  11a2  —  IA3  U  A4|  11a3ua4i 
and  Ha, 3  :=  |A3|_1]lAg  —  |A4|_11A4.  The  way  we  order  the  children  of  A  is  not  important. 
Clearly,  1A,  //AJ ,  HA, 2,  and  HA, 3  form  an  orthogonal  system  which  spans  the  set  of  all 
piecewise  constants  over  { A ;  j ;  , .  Then 

Hr  ■=  {Ha,i,  Ha,2,  Ha,3}aet 

is  a  Haar  basis  associated  with  T ■  It  is  easily  seen  that  Hr  is  an  orthogonal  basis  in  L2(R2)- 
It  can  be  proved  by  a  standard  technique  that  Hr  is  an  unconditional  basis  for  Lp(R2), 
1  <  p  <  00,  and  that  Hr  characterizes  the  skinny  norm,  a  >  0,  1/r  =  a  +  1/p. 

As  a  consequence,  the  nonlinear  n-term  Lp-approximation  from  7i.j  can  be  characterized  as 
above  (compare  with  [P2] ) .  We  skip  the  details  of  these  results. 
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3.4  Conclusions  and  open  problems 

We  bring  forward  again  the  fundamental  question  of  how  to  measure  the  smoothness  of  the 
functions.  There  is  a  close  connection  between  sparsity  of  representation  and  smoothness 
of  functions  that  we  also  wish  to  discuss  here.  As  we  mentioned  in  the  Introduction,  we 
believe  that  in  highly  nonlinear  approximation  as  well  as  in  some  other  nonlinear  problems 
the  smoothness  of  the  functions  should  not  be  measured  using  a  single  space  scale  (like 
Besov  spaces)  but  by  a  family  (library)  of  suitable  space  scales.  To  explain  this  concept 
more  precisely  we  return  to  n-term  Courant  element  approximation  considered  in  §3.2.  For 
this  type  of  approximation,  a  function  /  should  naturally  be  considered  of  smoothness  order 
a  >  0  if  infr  ||/||na(r)  <  oo,  which  means  that  there  exists  an  LR-triangulation  7/  such  that 
||/||b“(7})  <  oo.  Then  the  rate  of  the  n-term  Lp  -  ap  p  r  ox  i  m  at  ion  of  /  from  the  library  {$7-}  is 
at  least  0(n~a).  It  is  an  open  problem  to  develop  effective  procedures  that:  (a)  determine  (or 
estimate)  the  maximal  smoothness  a  of  a  given  function  /  and  (b)  for  a  given  function  /,  find 
an  LR-triangulation  7}  such  that  ||/||n“(Tf)  ~  infr  ||/||,b«(t)-  Another  related  open  problem 
is  to  determine  whether  for  each  function  /  G  Lp  there  exists  a  single  LR-triangulation  7/ 
such  that  the  n-term  Lp  -  ap  p  r  ox  i  m  at  ion  of  /  from  the  library  {Tt}  can  be  characterized 
using  the  B-spaces  R“(7/). 

An  important  issue  for  discussion  is  the  smoothness  of  the  approximating  tool  $7-  :  = 
{LPo}eeG(T)-  Clearly,  in  nonlinear  approximation,  there  is  no  saturation,  which  means  that 
the  corresponding  approximation  spaces  A 1  are  nontrivial  for  all  0  <  7  <  00.  Therefore, 
the  smoothness  spaces  to  be  used  should  naturally  be  designed  so  that  the  basis  functions 
{pe}  are  infinitely  smooth.  This  was  one  of  the  guiding  principles  to  us  in  constructing  the 
B-spaces.  For  instance,  the  Courant  elements  {po}oeQ(T)  are  infinitely  smooth  with  respect 
to  the  B®(T)  space  scale,  namely,  1 1  1 1  s«(r)  A  cllv^llp  for  0  <  a  <  00  (see  §2.3).  This  makes 

it  possible  that  our  direct,  inverse,  and  characterization  theorems  impose  no  restrictions  on 
the  rate  of  approximation  0  <  a  <  00  (see  §3. 2-3. 3).  Also,  this  explains  the  complete  success 
of  Besov  spaces  in  the  univariate  nonlinear  piecewise  polynomial  (spline)  approximation 
in  Lp  ( p  <  00).  The  important  fact  is  that,  any  univariate  piecewise  polynomial  (with 
finitely  many  pieces)  is  infinitely  smooth  with  respect  to  the  corresponding  Besov  spaces. 
More  precisely,  for  univariate  discontinuous  piecewise  polynomials,  the  Bernstein  inequality 
holds  without  any  restriction  on  the  smoothness  parameter  a(0<a:<oo)ifp<oo  (see 
Theorem  2.2  from  [PI]).  In  dimensions  el  >  1,  however,  the  situation  is  totally  different. 
Even  for  nonlinear  approximation  from  regular  piecewise  polynomials  (piecewise  polynomials 
generated  by  regular  triangulations,  in  our  terms),  the  Besov  spaces  are  not  exactly  the  right 
smoothness  spaces.  Namely,  the  Besov  spaces  coincide  with  the  right  smoothness  spaces 
only  for  some  range  of  the  smoothness  parameter  a.  For  instance,  for  nonlinear  n-term  Lp- 
approximation  from  Courant  elements  generated  by  a  regular  triangulation  of  R2 ,  the  Besov 
spaces  R2“,2(Lr),  1/r  :=  a  +  1/p,  0  <  p  <  00,  are  the  right  spaces  only  for  0  <  a  <  1  +  1/p. 
In  the  case  of  discontinuous  piecewise  polynomial  approximation,  the  range  is  0  <  a  <  1/p 
(see  §2.5).  For  the  same  reason,  the  fat  B-spaces  (§2.5)  are  not  exactly  the  right  spaces  for 
characterization  of  n-term  Courant  element  approximation  over  general  triangulations. 

In  nonlinear  n-term  approximation,  it  is  natural  to  work  with  bases.  Except  for  the 
simplest  case  of  n-term  piecewise  constant  approximation  (see  the  end  of  §3.3),  we  are  not 
aware  of  good  (unconditional)  bases  for  Lp( R2)  (1  <  p  <  00)  and  the  B-spaces  over  general 
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triangulations.  However,  as  was  shown  in  the  previous  sections  there  are  equally  powerful 
means  to  tackle  the  problems.  Namely,  using  simple  projectors  into  subspaces  of  piecewise 
polynomials,  one  can  get  sufficiently  spars  representations  of  the  functions,  which  allow  to 
capture  the  rates  of  the  best  nonlinear  n-term  spline  approximation.  It  is  an  open  problem 
to  construct  good  bases  consisting  of  continuous  or  smooth  compactly  supported  piecewise 
polynomials  (or  other  functions)  over  general  triangulations. 

Methods  and  algorithms  for  piecewise  polynomial  approximation  are  in  demand.  This 
was  one  of  the  primary  motivations  for  this  work. 

4  Appendix 

4.1  Appendix  1:  Proof  of  the  Bernstein  estimates 

In  this  subsection,  we  prove  Theorems  3.7  and  3.11.  We  recall  our  assumptions:  0  <  p  <  oo, 
a  >  0,  and  r  :=  (a  +  1/p)-1. 

Tree  structure  in  7~  generated  by  A  C  T.  Suppose  T  is  a  multilevel  triangulation 
(WLR  or  better),  and  let  A  C  T  and  #A  <  oo.  The  set  A  induces  a  tree  structure  in  T  that 
we  want  to  bring  forward  here  and  utilize  in  the  proof  later  on.  We  shall  use  the  parent-child 
relation  in  T  induced  by  the  inclusion  relation:  Each  triangle  A  G  Tm  has  (contains)  <  M0 
children  in  Tm+i  and  has  a  single  parent  in  Tm-i- 

Let  T0  be  the  set  of  all  A  G  7~  such  that  Ad  A'  for  some  A'  G  A.  We  denote  by  Fb  the 
set  of  all  branching  triangles  in  T0  (triangles  with  more  than  one  child  in  T0)  and  by  Tr6  the 
set  of  all  children  in  T  of  branching  triangles  (each  of  them  may  or  may  not  belong  to  F0). 
Now,  we  extend  T0  to  T  :=  T0  U  F7ft.  We  also  extend  A  to  A  :=  A  U  F6  U  F7ft.  In  addition, 
we  introduce  the  following  subsets  of  F:  Vf  the  set  of  all  final  triangles  in  F  (triangles  in 
F  containing  no  other  triangles  in  F)  and  Fcb  :  =  F  \  A  the  set  of  all  chain  triangles.  Note 
that  each  triangle  A  £  Fcb  has  exactly  one  child  in  F.  Since  the  final  triangles  in  F0  belong 
to  A,  then  ffFb  <  fiA  and  hence  ffT'b  <  M0ffFb  <  c#  A,  ffiF  f  <  ff  A  +  ffF'b  <  off  A,  and 
#A  <  ff A  +  ffiFb  +  ffF'b  <  c# A.  Note  that  ffFch  can  be  uncontrolably  larger  than  ff A. 

We  next  introduce  chains  in  Fc^.  By  definition  A  =  {Ax,  . . . ,  A/  }  C  Fch  [l  >  1)  is  a  finite 
chain  in  Fc/j  if  A"  DAiD--oA|D  Aa  for  some  Aa,  A"  £  A,  Ai  is  a  child  of  A",  A j  is  a 
child  of  Aj_i,  j  =  2, . . . ,  £,  and  Aa  is  a  child  of  A<>.  Notice  that  A"  ^  F/,  and  hence  Ai  is  the 
only  child  of  A"  in  T.  We  let  C  denote  the  set  of  all  finite  chains  in  Tch.  Also,  by  definition 
A  =  {. . . ,  A_2,  A_x}  C  Fch  is  an  infinite  chain  in  Tch  if  we  have  •  •  •  D  A_2  D  A_x  D  Aa 
for  some  Aa  £  A,  A  j  is  a  child  of  A  j_i,  j  =  —1,  —2, . . .,  and  Aa  is  a  child  of  A_x.  We  let 
C°°  denote  the  set  of  all  infinite  chains  in  Fc/j.  Clearly,  £  U  C°°  consists  of  disjoint  chains  of 
triangles,  Tch  =  IJag£u£^  and  #(^  U  £°°)  <  # A. 

Finally,  we  use  the  above  sets  to  introduce  rings  generated  by  A.  First,  for  each  A  £ 
T  \  (r6  U  F/),  we  denote  by  A  (A  /  A)  the  unique  largest  triangle  from  A  contained  in  A. 
We  associate  with  each  A  £  T  \  (Fft  U  T f)  a  ring  AA  defined  by  AA  :=  A  \  A.  Also,  we 
define  AA  :=  A  if  A  £  F f  and  AA  :=  0  if  A  £  Tj  U  (T  \  T).  Notice  that  if  A  £  X  for  some 
A  G  C  U  £°°,  then  A  =  Aa.  It  is  readily  seen  that  Kf,  n  Kf,,  =  0  if  A7,  A"  G  A  and  A7  A77, 

A  =  |J  AA',  for  A  G  A,  (4.1) 

A'EA,  A'CA 
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and  hence 


(4.2) 


U  A  =  U  K*' 

AeA  A'eA 

For  the  proof  of  both  theorems,  we  need  the  following  lemma. 

Lemma  4.1.  Suppose  S  =  J^aga  1a  •  Pa,  where  Pa  G  n*.  ( k  >  1),  A  C  T  with  T  a 
WLR-triangulation,  and  <  oo.  Then 

(  e  |Ar"iiisirMA,)1/T  <  c(#A)”nsiiP 

aga 

with  c  depending  only  on  p,  a,  and  the  parameters  of  T ■ 

Proof.  We  adopt  all  necessary  notation  from  “Tree  structure  in  T  generated  by  A  C  T” 
developed  above  with  T  and  A  from  the  hypotheses  of  the  lemma.  We  may  assume  that 

S=^2  Ia-Pa- 

aga 

It  is  an  important  observation  that  S  is  a  polynomial  of  degree  <  k  on  each  ring  A  A  =  A\  A. 
Hence,  using  Lemma  2.7, 

l|S|k(*4)  «  «  lArllSIl^,.  (4.3) 

We  shall  also  need  the  obvious  estimate  (see  (2.1)): 

(lA,|/lAD7  <  C(P,7)  <  oo,  7  >  0-  (4-4) 

Aer,  ad  A' 

We  use  (4.1)-(4.4)  to  obtain 

E  msiiL.A,  =  E  iArr  E  iisiil*,,> 

AeA  AeA  A'eA.A'cA 

=  E  iisiiLt^,)  E  iAi_“r 

A'eA  AeA,  ad  A' 

<  E  iisnwi.,iA'i““T  E  (iA'i/lAl)“T 

A'eA  Aer,  ad  A' 

<  »  E  2  c(  E  I«IW>),Va>,-'/'  £  «(#a)"|s|;, 

A'eA  A'eA 

where  we  once  switched  the  order  of  summation  and  applied  Holder’s  inequality.  □ 

Proof  of  Theorem  3.7.  Let  S  G  E n{T)  with  T  an  LR-triangulation  and  suppose  that 
S  =:  YhoeM  cdTe,  where  A4  C  0(T)  and  #A4  <  n.  Let  A  be  the  set  of  all  triangles  A  G  T 

which  are  involved  in  all  9  G  A4.  Then  S  =  X^AeA^A,  where  Sa  ='■  1a  ■  Pa,  Pa  G  n2. 

Evidently,  #A  <  A0#A4  <  cn.  For  the  rest  of  the  proof,  we  adopt  all  the  notation  from 
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“Tree  structure  in  T  generated  by  A  C  T”,  given  in  the  beginning  of  this  section,  with  T 
and  A  from  the  above.  In  addition,  we  denote 


X*n  :=  {A  G  Tm  :  A  C  Qa'  for  some  A'  6  A  D  Tm }, 

X**  :=  {A  G  Tm  '■  A  C  VL2a,  for  some  Ar  G  A  n  Tm},  where,  for  A  G  Tm, 

nA  :=  (J{Ar  G  Tm  :  A'  n  A  /  0}  and  Q‘i  :=  |J{A'  G  Tm  :  A'  n  0A  ±  0}- 

Also,  we  denote  X*  :=  \JmeZ  X£  and  X**  :=  U meZ^m-  Evidently,  we  have  < 

3iV0#A  <  an  and  #X**  <  3Nq#A  <  an. 

For  rn  G  Z,  we  denote  Sm  :=  level (0)<m  We-  Clearly,  Sm  G  <Sm  and,  therefore,  for 

A  ^  Tn , 

§a(S)t  =  Sa(5  -  STO)T  <  ||5  -  STO||MnA).  (4.5) 

We  shall  also  use  the  obvious  inequality  Sa(<5)t  <  ||5'||lt(qa). 

Next,  we  estimate  ||<S'||^a^  :=  YIabt  I A|_“tSa(<S')^  by  splitting  up  T  into  two  subsets, 
namely,  X*  and  T\X*. 

(i)  If  A  G  X*n .  then  A  C  for  some  A'  G  A  n  Tm  and  hence  QA  C  0^.  From  this,  we 

find  _  __ 

Si(S);  <  ||s||y(0il  =  E  HsHma.)  <  E  IIsHma-) 

A*€Tm,A*cnA  A*6T„„A*cn|, 

and  hence,  using  (2.2), 

|A|-"rsa(sy  <  c  E  lA*r“lsIILcA*)- 

A*erm,A*cn^ 

Therefore, 

E  |A|-“Tsa(S);  <  c  E  IaI““tII5|IL(a) 

Ac  a;i,.  Aex- 

and,  summing  over  m  G  Z,  we  find 

E  ia|— sa(si;  <  c  e  iAr'iisiiL(a) 

Aex*  Aex** 

<  e(^x*TT\\s\\;<cn^\\s\\;,  (4.6) 

where  we  applied  Lemma  4.1  to  S  with  A  replaced  by  X**  which  is  legitimate  since  X**  D  A 
and  hence  S  has  the  required  representation. 

(ii)  Let  A  G  Tm,  \  T*n.  Then  0A  =:  U"=i  for  some  A?  e  (rc/j  n  Tm)  U  (Tm  \  T), 
j  =  1, . . . ,  nA,  with  riA  <  3A^.  We  have,  using  (4.5), 

riA 

Sa (S);  =  sa(s  -  sm%  <  E  IIs  -  S.IIW  (4-7) 

3= 1 

Note  that  if  A  j  G  Tm  \  T,  then  S|A.  =  SmUj  and  hence  ||S  -  S,„|  =  0. 
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Suppose  A j  G  rch  n  Tm-  It  is  an  important  observation  that,  in  this  case,  S\ka,  = 
Sm | k&.  =  1  ka.  ■  Pa j  and  Sm\Aj  =  1A.  ■  I\  .  for  some  PAj  G  II2,  where  KA.  :  A,  \  A, 
(A j  G  A)  is  the  ring  associated  with  Ar  Using  this,  we  find 


'MAj) 
iot— 1 


+  CHPAj  lAk.) 


ii5-smirMAj)  =  ns-sm[rMAj)<c||5|| 

<  c||5||^tAj)+c|Ai||Air-1||5||I,(jrAj) 

For  the  last  inequality  in  (4.8)  we  used  that 


(4.8) 


A 


U'MAj)  - 


<  I  A,- 1  IIP, 


Aj  llLoo(Aj)  a 


<  c|  A  ;  |  IIP 


<  cIAJIAJ-^IIPa 


Aj  1 1  Loo  (Ka.  ) 

<  e|A;||A 


(4.9) 


I  OLT  —  1 


l|5|| 


Lp(KAp 


PH L„(KAj)  A 

where  we  applied  Lemma  2.7  and  used  that  S\ka.  =  Pa,  k-a  .  •  From  (4.7)-(4.8),  we  infer 

|A| 

UAru,ii^nL(A)  + 

a  eT\x*  meZAerchnTm 


E  gi-"tSa(s);  <  cE  E  (iA|-”isirtr(A,  +  A}iisiiip(fe)) 


A| 


<  c  E  iArQTii5iii(A)+c  E  ^ii5n 


Aercfe 

=:  Si  +  E2. 


Aery 


lp(ka ) 


Switching  the  order  of  summation  and  applying  (4.4),  we  obtain 

El  =  *EMIm*,  E  lAr°T 

A'eA  AGrcfe,ADA' 

<  c  E  iisn L(A-)iA'r"  E  <ia'I/iai>‘ 


(4.10) 


A'eA  Aer.ADA' 

<  <=  E  iAr“isirMA,)  <  c(#A)“tiisii;, 

A'eA 


where  for  the  latter  estimate  we  applied  Lemma  4.1  to  S'  with  A  in  place  of  A. 

To  estimate  E2,  we  shall  use  the  representation  of  Tch  as  a  disjoint  union  of  chains:  Tch  = 
UAe£u£°°  Let  ^  ^  £  and  suppose  A  =  {Ai, . . . ,  A*},  where  A"  D  Aj  D  •••  D  A<  D  Aa 
with  Aa,  A"  G  A  (A"  £  r6).  Then 


^|A||A|-'||S|rMXA) 

AeA 


<  IISII^EI^IAA1 

3= 1 


3= 1 


If  A  G  C°°  and  A  G  A,  then  S|/AA  =  0  and  hence  ||S||lp(a'a)  =  0. 
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Summing  the  above  inequalities  over  all  A  G  £,  we  obtain 

^ *c#A)“-||s|i;,  (4.ii) 

A*eA  a*ga 

where  we  used  Holder’s  inequality  and  (4.2).  Estimates  (4.10)-(4.11)  yield 

5]  |a|— Siisj;  <  c(#a)“t||s|i;  <  cn"||s||;. 

a  eT\x* 

This  and  (4.6)  imply  [|<S'||]?Q(-r-)  <  cn“T||S,||p.  □ 

Proof  of  Theorem  3.11.  Let  T  be  a  WLR-triangulation  and  S  G  E *(T).  Then  S  can  be 
written  in  the  form  S  =  XIaga  1a  •  Pa,  where  Pa  G  n*.,  A  C  T,  and  #A  <  n.  As  in  the 
previous  proof,  we  adopt  all  the  notation  from  “Tree  structure  in  T  generated  by  A  C  T" 
with  T  and  A  from  the  above. 

To  estimate  ||S,||^Q(-r-)  :=  XIagt  1^1  ~aTuJk(S,  A)l,  we  shall  split  T  into  three  subsets: 

(i)  If  A  G  T  \  T,  then  S  is  a  polynomial  of  degree  <  A:  on  A  and  hence  Uk(S,  A)T  =  0. 

(ii)  If  A  G  A,  then  evidently  LOk(S,  A)r  <  c||S'||Lt(a)  and  hence 

E  |A|-“To.t(S,  A);  <  cE  |A|-“T||StlL(i)  <  c(#A)“T||S||»,  (4.12) 

AeA  AgA 

wliere  for  the  last  inequality  we  used  Lemma  4.1  (with  A  replaced  by  A). 

(iii)  Let  A  G  Tc/,  (recall  that  Tch  :=  F  \  A).  Clearly,  S \Ka  =  1Ka  ■  Pa  for  some  Pa  G  Hfe-, 
where  Ka  :=  A  \  A  is  the  ring  associated  with  A.  Therefore, 

Ms,  a);  =  ojk(s  -  pa,a)tt  <  c||S||y(i)  +  c\\p^\\ii[k] 

<  c||S|rtr(A,+c|A||A|— 'IlSIli^,,  (4.13) 

where  we  used  that  II^aIIGa,  <c|A||A|»—  ||/>a|IG*ai  which  follows  by  Lemma  2.7  exactly 
as  in  (4.9).  From  (4.13),  we  infer 

E  |A|-«^t(S,A);  <  cE  |A|““Tl|S|(t(i,  +  eE  lAIIAI-'IISIIl,,^, 

Aercfe  AGrc/!  AGrc/i 

=:  E*  +  E* 

We  estimate  E*  and  E£  exactly  as  the  sums  Ex  and  S2  were  estimated  in  (4.10)  and  (4.11), 
respectively.  We  obtain 

E  |A|-“**(S,  A);  <  C(#A)-||S||;  <  cn”||S||;. 

AGrcfe 

Combining  this  estimate  with  (4.12),  we  find  ||5’||gQ(7-)  <  cn"T||S,||p  and  the  proof  of  Theo¬ 
rem  3.11  is  complete.  □ 
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4.2  Appendix  2 

Proof  of  Lemma  2.6  [Whitney].  Suppose  V  C  R2  is  a  parallelogram  and  /  G  Lq(V). 
Evidently,  there  exists  an  affine  transform  A  which  maps  V  one-to-one  onto  [0,  l]2.  Whitney’s 
estimate 

Ek{f,V)q  <  cujk{f,V)q  (4.14) 

is  invariant  under  affine  transforms  and,  hence,  follows  from  the  case  V  :=  [0,  l]2.  For  the 
proof  of  Whitney’s  inequality  on  [0,  l]2,  we  refer  the  reader  to  [B]  (for  the  case  of  q  >  1)  and 
[SO]  (for  the  case  of  0  <  q  <  1). 

Now,  having  (4.14),  we  can  prove  Whitney’s  estimate  for  a  triangle  as  well.  Fix  an 
arbitrary  triangle  A  =  [aq,  :r2,  aq].  Let  yi  :=  ( Xz  +  a.’3)/2,  y2  :=  (aq  +  aq)/2,  and  ys  :  = 
(aq  +  a;2)/2  be  the  midpoints  of  its  edges,  and  let  A'  :=  [2/1,  j/2, 2/s] -  Consider  now  the  three 
parallelograms  Vi  :=  [aq,  y3,  ?q,  y2],  Vz  :=  [x2,yi,y2,y3],  and  V>  :=  [^3,  y2, 2/3,  yi]-  Clearly, 
A  =  |J:j=  |  V:i  and  A'  =  f]j=|  Vr  We  select  polynomials  P&  ,  P| ,  P2 ,  P3  G  11/.  such  that 
11/  -  iVlk(A')  =  Ek ( / 1  A'),  and  \\f  -  Pj\\Lq(v3)  =  Ek{f ,  Vj)q  for  j  =  1,  2,  3.  Evidently,  since 
A'  C  Vj  and  | Vj\  =  2|A'|,  using  Lemma  2.7  and  (4.14),  we  have 

l\Pj  -  Pa'Wl^Tj)  <  A  Ej  <  c\\f  -  Pj\\Lq(A')  +  c\\f  ~  PA'\\Lq(A') 

<  c\\f  ~  Pj\\Lq(Vj)  +  cEk(f,  A’)q  <  cEk(f\Vj)q 

<  CU}k(f,Vj)q<CUk(f,  A)g 

with  c  =  c(q ,  k).  From  this,  we  obtain 

3 

<  ||/  -  Pa-lk(A)  <  11/  -  Pi'lli,(P,) 

j=  1 

3  3 

-  CX/ 11^  “  ^yl  Os  AS  +  CX/  “  ^illo(P)  -  CUJk{.f \  A),, 

./  1  >  1 

where  we  again  used  (4.14).  Thus  (2.11)  is  proved  for  a  triangle. 

To  prove  (2.11)  in  the  second  case  one  can  proceed  similarly,  using  that  the  estimate  is 
invariant  under  affine  transforms  and  most  importantly  that  T  is  an  SLR-triangulation  (see 
§2.1).  We  omit  the  details.  □ 

Proof  of  Lemma  2.12.  Let  S  G  Sm  be  an  element  of  best  L^-approximation  to  /  on  0A 
from  Sm .  Using  Lemma  2.7,(c)  and  Holder’s  inequality,  we  obtain 

s  •••  •  <  *  ?  . 

<  cSi(/),  +  c|!2Ar/’'-I*‘||S-S||M[!a) 

<  <=Sa(/)„  +  c|SJi|'/»-i/,.(||/  _  s||MOil  +  ||/  -  S||t,(0il) 

<  c§a(/)„  +  c||/-  S||i,(na)  <  cSi(/)„.  □ 

Proof  of  inequality  (3.11).  We  shall  use  the  obvious  inequality 

ciabs~a  <  (a  +  b)s ,  if  0  <  a  <  s  and  a,  b  >  0,  (4-15) 
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which  is  immediate  from  ( a/b)a  <  ( a/b  +  1)"  <  ( a/b  +  l)s.  Now,  set  a  :  =  1/r  —  1/p, 
s  :=  1/r  >  a,  a  :=  nxTn,  and  b  :=  Yh‘jLn+ ix]-  Applying  inequality  (4.15),  we  find 

OO  -t  i  OO  -t  /  OO  -t  / 

^=n+l  ji=n+l  ^‘=n+l 

o°  -j jT 

=  n~aaab1^T~a  <  n~a(ci  +  b )1//r  <  n_° ^ 

j=i 
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